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AN INTRODUCTION TO INEQUALITIES 


Preface 


Mathematics has been called the science of tautology; that is to 
say, mathematicians have been accused of spending their time prov- 
ing that things are equal to themselves. This statement (appropriately 
by a philosopher) is rather inaccurate on two counts. In the first 
place, mathematics, although the language of science, is not a science. 
Rather, it is a creative art. Secondly, the fundamental results of 
mathematics are often inequalities rather than equalities. 

In the pages that follow, we have presented three aspects of the 
theory of inequalities. First, in Chapters 1, 2, and 3, we have the 
axiomatic aspect. Secondly, in Chapter 4, we use the products of the 
preceding chapters to derive the basic inequalities of analysis, results 
that are used over and over by the practicing mathematician. In 
Chapter 5, we show how to use these results to derive a number of 
interesting and important maximum and minimum properties of the 
elementary symmetric figures of geometry: the square, cube, equilat- 
eral triangle, and so on. Finally, in Chapter 6, some properties of dis- 
tance are studied and some unusual distance functions are exhibited. 

There is thus something for many tastes, material that may be 
read consecutively or separately. Some readers will want to under- 
stand the axiomatic approach that is basic to higher mathematics. 
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They will enjoy the first three chapters. In addition, in Chapter 3 
there are many illuminating graphs associated with inequalities. 
Other readers will prefer for the moment to take these results for 
granted and turn immediately to the more analytic results. They will 
find Chapter 4 to their taste. There will be some who are interested 
in the many ways in which the elementary inequalities can be used 
to solve problems that ordinarily are treated by means of calculus. 
Chapter 5 is intended for these. Readers interested in generalizing 
notions and results will enjoy the analysis of some strange non- 
Euclidean distances described in Chapter 6. 

Those whose appetites have been whetted by the material presented 
here will want to read the classic work on the subject, /nequalities, 
by G. H. Hardy, J. E. Littlewood, and G. Pélya, Cambridge Univer- 
sity Press, London, 1934. A more recent work containing different 
types of results is Inequalities, by E. F. Beckenbach and R. Bellman, 
Ergebnisse der Mathematik, Julius Springer Verlag, Berlin, 1961. 


E. F. B. 
R. B. 


Santa Monica, California, 1960 


CHAPTER ONE 


Fundamentals 


1.1 The “Greater-than” Relationship 


You will recall that the symbol “>” means “greater than” or “is 
greater than.” Then you can readily answer the question: Is 3 > 2? 
Of course it is. 

But is —3 > —2? Admittedly, —3 is a “greater negative number” 
than —2, but this statement does not answer what is meant by the 
question. If the real numbers (zero and the positive and negative 
rational and irrational numbers) are represented geometrically in the 
usual way by points on a horizontal number scale directed to the 
right, as indicated in Fig. 1.1, then the numbers appear in order of 


to ttt tt tt Ott 
“$§ -4 #3 -2 -l 1°) ' 2 3 4 5 
Figure I.1. A real-number scale 


increasing value from left to right. The point representing —2 ap- 
pears to the right of the point representing —3, and accordingly 
—2 > —3. Similarly, 


(iycas<4. 352, 0-9) af 2. cio: 


Hence we have the following geometric rule for determining 
inequality: Let a and b be any two real numbers represented by points 
on a horizontal number scale directed to the right. Then a > b if and 
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only if the point representing a lies to the right of the point representing b. 

You can say that —3 > —2, or that —300 > —2, but it is not 
true according to the foregoing geometric rule. 

In dealing with inequalities, it is often more fruitful and even nec- 
essary to work algebraically instead of graphically. The geometric 
rule given above has, in terms of the basic notion of positive number, 
the following simple algebraic equivalent: 


DEFINITION. Let a and 6 be any two real numbers. Then a > db if 
and only if a — b is a positive number. 


Thus, if a = —2 and b = —3, thena — b = —2 — (—3) = 1 is 
positive. Hence —2 > —3, as noted above in the geometric discus- 
sion. You might check the inequalities in (1.1) by the present alge- 
braic method of subtraction and verify each of the following inequa- 
lities both by the geometric method and by the algebraic method: 


#33, 250, 15-29, -<@sT, -}>-40. 


1.2 The Sets of Positive Numbers, Negative Numbers, and Zero 


You will note that in the preceding section we defined the inequality 
a > b in terms of positive numbers. The set P of positive numbers, and 
similarly the set N of negative numbers, as well as the special set O 
having as its only member the number 0, play essential roles in the 
study of inequalities. In fact, while of course we shall freely use the 
familiar algebraic (field) properties of the real number system, such 
as the commutative, associative, and distributive laws, a basic thesis of 
this entire tract is that all order relationships in the real number system 
—all algebraic inequalities—can be made to rest on two simple axioms 
regarding the set P of positive numbers. These axioms will be presented 
in the following section. 

Symbolically, for “a is positive” we write “ae P,” read in full as 
“a is a member (or element) of the set P.” Thus we have Se P, 0€ 0, 
—3eN. 

Let us look briefly at the foregoing sets, P, N, and O, and their 
members. 

The number zero, of course, is the unique member 0 of the set O; 
it satisfies the equation 


a+0O=a 


for any real number a. 
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Regarding the set N of negative numbers, it is important to dis- 
tinguish the idea of the negative of a number from the idea of a nega~ 
tive number: 

The negative of a number a is defined to be the number —a such 
that 


(a) + (—a) = 9. 


Thus, if a = —3 then the negative of a is —(—3) = 3, since 
(—3) + (3) = 0. Similarly, if a = 0 then —a = 0 since0 +0 =0. 

A negative number is defined to be the negative of a positive num- 
ber. Thus, you recognize 3, 1/2, 9/5, 7, \/2 as being members of the 
set P of positive numbers. Then —3, —1/2, —9/5, —a, —\/2 are 
members of the set N of negative numbers. 

We shall not attempt to define the basic notion of a positive num- 
ber, but shall now proceed to characterize these numbers by means 
of two basic axioms. 


1.3 The Basic Inequality Axioms 


The following simple propositions involving the set P of positive 
numbers are stated without proof; accordingly, they are called 
axioms. It is interesting to note that they are the only propositions 
needed, along with the familiar algebraic structure of the real num- 
ber system,} for the development of the entire theory of inequalities. 


Axiom I. [fais areal number, then one and only one of the following 
statements is true: a is the unique member 0 of the set O; a is a mem- 
ber of the set P of positive numbers; —a is a member of the set P. 


AxIOM II. [fa and b are members of the set P of positive numbers, 
then the sum a + b and the product ab are members of the set P. 


The three alternatives listed in Axiom I relate an arbitrary real 
number a and its negative —a as follows: If a is zero, then —a is 
zero, as already noted; if a is positive, then —a is negative by the fore- 
going definition of negative number; and if —q is positive, then 
a = —(—a) must be negative, again by the definition of negative 
number. Thus a and —a are paired in the sets P, N, and O as indi- 
cated in Table 1. 


¢ But see the footnote on page 12. 
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TABLE |. Pairings of Numbers and Their Negatives 


Number 


In the geometric representation (Fig. 1.1), the points representing 
a and —a either coincide at the point representing 0 or lie on oppo- 
site sides of that point. 


1.4 Reformulation of Axiom I 


Axiom I is concerned with the set P of positive numbers, and the 
inequality a > b was defined in terms of the set P. Let us reformu- 
late this axiom in terms of the inequality relationship. 

If a and 6 are arbitrary real numbers, then their difference, a — 5, 
is a real number; accordingly, Axiom I can be applied to a — b. 
Thus either (a — b)e O (that is, a = b), or (a — b)e P (that is, a > 5), 
or —(a — b) = (b — a)e P (that is, b > a), and these three possibili- 
ties are mutually exclusive. Hence, the following statement is a con- 
sequence of Axiom I: 


Axiom I’. Ifa and b are real numbers, then one and only one of the 
following relationships holds: 


ax=b, a>b, b> a. 


In particular, Axiom I’ asserts, in the special case b = 0, that if a 
is a real number then exactly one of the following alternatives holds: 
a = 0 (that is, ae O), or a> 0 (that is, ae P), or 0 >a (that is, 
—ae P). Accordingly, Axiom I can be deduced from Axiom I’. 

If a statement S can be deduced from—i.e., is a consequence of — 
a statement T, we say “7 implies S.” We have just seen that Axiom I 
implies Axiom I’ and also that Axiom I’ implies Axiom I. If each of 
two statements implies the other, we say that they are equivalent. 
Thus, Axioms I and I’ are equivalent. 

To illustrate Axioms I and I’, consider the numbers a; = 3, 
a2g2= —4,b,=0,b2 =3. 

Illustrating Axiom I, you note that a,eP, —azeP, b,eO, and 
ba P; you note also that a; ¢ O (read “a; is not a member of the 
set O”), and —a,¢P, etc. 
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Illustrating Axiom I’, you have 


a,—b,;=3-0=3, a4—b,>0, a> by; 
a, —be =3 -—3 =0, a,—b,=0, a; = be; 
a2 —-b, = —4—0= —4, bj—a2>0, by > a2; 
a2 — bg = —4-—3= —-7, be — a, >0, be > ae. 


You note, then, that in each of the four instances one and only 
one of the three relationships given in Axiom I’ holds. This illustra- 
tion of Axiom I’ will be continued in the following section, as addi- 
tional inequality relationships are introduced. 


1.5 Additional Inequality Relationships 


In place of an inequality such as b > a, you might equally well 
write a < b, read “a is less than b.” The two inequalities are entirely 
equivalent and neither is generally preferable to the other. In the 
foregoing illustration of Axiom I’, the sign “>” was used throughout 
for the sake of consistency. But you might just as well have consid- 
ered it preferable to be consistent in writing the a’s before the 6’s in 
all the relationships. Then you would have had 


(1.2) %>bh, @=be2, ach, az<be. 
Likewise, 


—4<4, 2<3, —2<0, —-2<-I, 0<i, 
3<m7, 0<¢2, -9<1, 1< 2, -40< -f. 


The symbols “>” and “<” represent strict inequalities. 

Two other relationships considered in the study of inequalities are 
the mixed inequalities a > b and a < b, read “a is greater than or 
equal to b” and “a is less than or equal to b,” respectively. The first 
of these, a > b, means that either a >b or a= b; for example, 
3 > 2 and also 2 > 2. The second, a < b, means that either a < b 
or a = b; thus 1 < 2 and also 2 < 2. 

In (1.2) it is stated that one of the three relationships listed in 
Axiom I’ holds in each instance. But the axiom itself asserts further 
that only one of the relationships holds. Therefore, to make the illus- 
tration of Axiom I’ complete, you really should add the statements 


(1.3) aEh, aEb,, aph, azh be, 
read “a, is neither less than nor equal to 5,,” etc. 
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You naturally feel that these negative statements in (1.3) are su- 
perfluous, and indeed no one would claim that you should ordinarily 
write them down to complete the information contained in (1.2). This 
is due to the fact that the exclusiveness principle—the “one and only 
one” aspect—of Axiom I or I’ is taken for granted. 

Because of the exclusiveness principle, the respective relationships 
in (1.2) and (1.3) clearly are equivalent; that is, each implies the 
other. Nevertheless, the negation of an inequality is often a very use- 
ful concept. 

If you tend to confuse the two symbols “>” and “<<”, you might 
note that in a valid inequality, such as 3 > 2 or 2 < 3, the larger 
(open) end of the symbol is toward the greater number while the 
smaller (pointed) end is toward the /esser number. 


1.6 Products Involving Negative Numbers 


What sort of number is the product of a positive number and a 
negative number? Or the product of two negative numbers? We can 
use Axioms I and II and some of their consequences to determine 
the answers to these questions. 

If ae P and be N, then —be P according to Table 1, so that the 
product a(—b)« P by Axiom II. Hence —{a(—b)]« N by the defini- 
tion of a negative number; but —{a(—)] = ab by the usual algebraic 
rules for interchanging parentheses and minus signs: 


—(a(—b)| = —[—(ab)] = ab. 
Therefore abe N, and thus we have the following result: 


THEOREM 1.1. The product ab of a positive number a and a negative 
number b is a negative number. 


Similarly, ifae N and be N, then —ae P and —be P according to 
Table 1. Hence, by Axiom II, their product (—a)(—b)e P. But by 
the rules of algebra, (—a)(— 6) = ab, and therefore abe P. Hence we 


get this result: 
THEOREM 1.2. The product ab of two negative numbers a and b is a 
positive number. 


In particular, by this last result and Axiom II, the square of any 
real number other than zero is a positive number. Of course, 0? = 0. 


FUNDAMENTALS iB 


Thus we obtain one of the simplest and most useful results in the 
entire theory of inequalities: 


THEOREM 1.3. Any real number a satisfies the inequality a? > 0. 
The sign of equality holds if and only ifa = 0. 


1.7 “Positive” and “Negative” Numbers 


By this time, you may realize the strength of Axioms I and II. You 
might be amused to learn that from them you could even determine 
which of the nonzero real numbers belong to the set P of positive 
numbers and which belong to the set N of negative numbers—as if 
you didn’t already know! 

To see this, let us for the moment put “positive” and “negative” 
in quotation marks to indicate information obtained from the axioms. 

Let us start with the number a = |. Since a= 0, it follows from 
Theorem 1.3 that a? > 0. Thus a? is “positive.” But 


so that I is “positive.” 

Next, let us try a = 2. Since we have now determined that 1 is 
“positive,” since | + 1 = 2, and since by Axiom II the sum of two 
“positive” numbers is “positive,” it follows that 2 is “positive.” 

Now let a = 4; then 2a = 1. Thus the product of the “positive” 
number 2 and the number a is the “positive” number 1. But if a were 
“negative,” then the product of 2 and a would be “negative,” by 
Theorem 1.1, Therefore a = 4 must be “positive.” 

Thus the numbers 1, 2, 4 are “positive” and hence, by Table 1, the 
numbers —1, —2, —4 are “negative.” 

Continuing, we can show that the integers 3, 4, etc.; the fractions 
4, 4, etc.; and the fractions 4, 4, 3, 3, etc., are “positive,” and accord- 
ingly that —3, —4, ~4, etc., are “negative.” Thus, for any nonzero 
rational number we can determine whether it is “positive” or 
“negative.” 

Finally, the limiting processes that are used in defining irrational 
numbers can be applied to determine, from our knowledge as to 
which rational numbers are “positive” and which are “negative,” 
whether a given irrational number is “positive” or “negative” in the 
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completely ordered field of real numbers.} We shall not discuss irra- 
tional numbers in any detail in this book; for an interesting treatment 
of them, you should read Numbers: Rational and Irrational, by Ivan 
Niven, also in this New Mathematical Library series. 


Exercises 


1. Make a sketch showing points representing the following numbers on a 
horizontal number scale directed to the right: 


3, -1, 0, -15, 7-3, 3-7, V2, 2, -2, —3. 
Rewrite the numbers in increasing order, giving the result as a continued 
inequality of the form a <b <c, ete. 


2. Put a stroke through e, thus ¢, if the statement is false: 


(a) —3eN, (f) azeN, 
(b) OeP, (g) (a2 + leP, 
(c) SeO, (h) —27eP, 
(d) V2eN, (i) (a2 + 1)€0, 
(ec) (m7 —3)eP, (j) —3eP. 
3. Fill in each blank with P, N, or O so that a true statement results: 
48 _ 49 2 
(a) 53 — aay * ; (f) 7? — 4(2)(6)e___., 
721 721 
(o) oy ~ yar’ , (g) 93(72 + 4) — 93(72)e j 
(c) 3 = SPe t (h) 93(72 — 4) — 93(72)e : 
— 23 ~23 . 2+3_1f2 
@) 7 -43° aS 3+ 
@ =-t-——. () (-3P 3. 


+ Here ordered means that Axioms I and II are satisfied, and complete refers to the 
basic property that, if a non-null set of real numbers has an upper bound, then it has 
a least upper bound. For example, the set {1, 1.4, 1.41,...} of rational approximations 
to \/2 is bounded from above by 2, or by 1.5; hence it has a least upper bound (which 
we denote by 1/2). The corresponding point on the number scale (see page 5) divides 
that scale into a left-hand and a right-hand portion. Since there is at least one “posi- 
tive” rational number—e.g. 1 or 1.4—in the left-hand portion, we say that \/7Z is 
“positive.” In Sec. 1.7 we have shown that the rational numbers can be ordered in only 
one way, and we have stated that, similarly, the real numbers can be ordered in only 
one way. The property of complete ordering, or its equivalent, is used in defining the 
teal numbers in terms of the rational numbers, and accordingly it is taken as a postu- 
late for the real numbers rather than as an Axiom III for inequalities. 
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4. Fill in each blank with >, <, or = so that a true statement results: 


48 49 
(a) 33 — 373” (f) 7? 42)(6), 
721 721 
(b) 337 —— BR” (g) 93(72 + 4) 93(72), 
—23 —25 = 
(©) 3 (h) 9372 — 4) 93(72), 
-—23 —23 1 243 Vf2,3 
Ve © 445 (G+ 3) 
ea pa on j) (-3P _-__ 8. 
© =— +. () (-3) 
5. Label T for true or F for false: 
(a) -22> -3 : (fF) -1<2 ay 
b) Oo<0 —_, (g) $4, “ey 
¢«). 0><1 ——, (h) -$2-+ ——, 
©) -4<-4$ 5 () 1<0 


6. Write the negative of each of the following numbers: 
a - — 1)? a 2 
2, 3-7, G7), aa 0, Vb? — 4ac. 


7. Fill in the blanks to give affirmative relationships equivalent to the 
stated negative relationships: 


(a) afd, a b, (d) afb, a b, 
(b) ab, a b, (ec) a}pb, a b, 
() a}b, a b, (f) ab, a b. 


8. Show that each positive number p is greater than each negative number n. 
9. For two real numbers a and 5, what single conclusion can you draw if 
you can show that a > b and that a < b? 

10. Prove by mathematical induction from Axiom II that if a1, a2,...,a. 
are positive, then the sum a; + a2 + -°+ + a, and the product a1G2 +++ a, 
are positive. (See the comments in Sec. 2.6.) 

11. Use Axioms I and II to show that 3 is a “positive” number. 


CHAPTER TWO 


Tools 


2.1 Introduction 


In dealing with inequalities, the only basic assumptions that are 
ultimately used are the two axioms discussed in Chapter 1, along with 
the real number system and its laws, such as the distributive law, 
mathematical induction, etc. Still, there are several simple theorems 
that derive from these axioms and that occur so frequently in the 
development and application of the theory that they might almost 
be called “tools of the trade.” 

These theorems, or operational rules, and their proofs are attrac- 
tive and interesting on their own merits. Moreover, they furnish an 
excellent illustration of the way mathematicians build up an elabo- 
rate system of results from a few significant basic notions and as- 
sumptions. The proofs are usually short, but nevertheless complete; 
and in just a few places they call for the touch of ingenuity that makes 
mathematics the fascinating subject that it is. 

In the present chapter, some of these theorems are listed, illus- 
trated, and proved. The letters a, b, c, etc., that occur in the state- 
ment of the theorems will be understood to represent real numbers, 
arbitrary except for explicitly given constraints. 

For convenience, the theorems, or rules as we shall sometimes call 
them, will be stated here only for the “>>” case. In each instance, 
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there is an equivalent “<” rule. Thus, to the “>” rule for transi- 
tivity, “If a> 6 and b> c, then a>c,” there corresponds the 
equivalent “<” rule, “Ifa < bandb<c, thena<c.” 

You can similarly construct the “<” rule that is the companion to 
each of the other “>” rules given in this chapter, but watch out for 
mathematical pitfalls! If a rule involves a positive multiplier, say 
c > 0, and requires that the difference of two arbitrary quantities be 
positive, then the companion “<” rule still has c > 0 (or 0 < cif 
you prefer), not c < 0. Thus the companion rule to “If a > 6 and 
c > 0, then ac > be” is “Ifa << b and c > 0, then ac < bc.” 

The statement of the theorem at the beginning of each section is 
divided into two paragraphs. The simpler first paragraph, dealing 
with the strict “>” inequality sign, contains the heart of the result. 
The second paragraph involves the mixed “>” inequality sign, and 
in addition it sometimes includes the case of an arbitrary number n 
of real values. Thus it deals with a more general and inclusive situa- 
tion. The proof is usually given for the more inclusive general case 
only, but it can readily be specialized to apply to the first paragraph. 

The illustrations of the theorems given in this chapter will some- 
times involve the stated “>” rule, and sometimes the implied “<” 
rule. 


2.2 Transitivity 


THEOREM 2.1. If a>b and b> c, thena>c. 
More generally, if a, > a2, d2 > 43,.-.,@n-1 2 Qn, then a, > an, 
with a, = a, if and only if all the a’s are equal. 


Thus, if a consideration of your personal expenditures has led you 
to observe that you spend more money on Saturday than on any 
weekday, and that you spend at least as much on Sunday as on Sat- 
urday, then you can conclude that you spend more money on Sun- 
day than you do on any weekday. 

Again, the solution of Exercise 1 in Chapter 1 is 


—3<¢ -2< -IS< —1<3-—-7<0< 7-3 VW2<2<3; 


this might be interpreted narrowly as meaning only that each of the 
first nine members of the set is less than the immediately succeeding 
number, thus: —-3< —2, —2< —155, etc. If so, it still implies, by 
the foregoing transitivity rule, that each of these numbers is less than 
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any succeeding number; for example, —3 << —1.5. -—1<2, 


3-ar< V2. 


PRooF. The transitivity rule could be proved by simple mathe- 
matical induction (for which see the comments in Sec. 2.6). But for 
this first rule, we shall give a single direct proof involving four real 
numbers. 

Suppose, then, that a; > a2, a2 > a3, a3 > a4. By the algebraic 
definition of inequality, each of the quantities a; — az, a2 — a3, 
a3 — @g is either in the set P or in the set O. Therefore the sum 


(a1 — a2) + (a2 — a3) + (43 — a4) = ay — Og 


is either in P, by Axiom II, or in O; and it is in O if and only 
if ay — @g = 0, ag — a3 = 0, a3 — ag = 0. Thus a; > ay, and the 
sign of equality holds if and only if a; = a2, a2 = a3, a3 = a4. 

The proof for the general case is left as an exercise. 


2.3 Addition 


THEOREM 2.2. If a>b and c>d, then at+c>b+d. If 
a> b, and c is any real number, then a+c>b+c. 
More generally, if a, > b1,a2 > be,...,@n > bn, then 


(2.1) Ay + Ag tere + Ay > by + bo + oe+ + dy. 


The sign of equality holds in (2.1) if and only if a, = by, az = be,..., 
an, = bn. 


Thus, if the inequalities 1 << \/2 and 3 < 7 are added, they yield 
1+3< 247. This last inequality combined with —1 = —1 
gives 3< \/2+ 7-1. And all five of these relationships added to- 
gether produce 10 < 3(\/2 + 7) — 2. 


ProorF. As in the case of transitivity, an inductive proof is again 
available. This time, however, a direct proof of the general case will 
be given. Since by hypothesis each of the quantities a, — b,, 
az — bg,...,an — bn & P or € O, then by the generalization of Axiom 
II given in Exercise 10 of Chapter 1, the sum (a; — by) + (a2 — be) + 
see + (Gn — bn) = (€1 + az + +++ + Qn) — (01 + bo +++ + bade P, 
unless it e O with a, — b; = 0, ag — be = O,...,an — b, = O. Thus 


Qy + Gz +++ + An > by + bo + +++ + dn, 


and equality holds if and only if ay = 6, ag = be,..., an = Dy. 
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2.4 Multiplication by a Number 


THEOREM 2.3. If a>b and c>0, then ac> be. If a>b and 
c<0, then ac< be. 

More generally, if a> b and c>0, then ac > be, with ac = be 
if and only if a=b. If a>b and c<0, then ac< be, with 
ac = be if and only ifa=b. 


Thus, multiplying the terms of an inequality by a positive number 
leaves the sign of inequality unchanged, but multiplying by a nega- 
tive number reverses it. In particular, for c = —1, if a> then 
—a<—b. 

For example, if you multiply 3 >2 by 1 and —1, you obtain 
3>2 and —3 < —2, respectively. 


PROOF. It is given that a>, so that a—beP or €O. If ceP, 
then it follows from Axiom II that c(a — b) = ca — cheP or €O; 
that is, ca > cb. 

But if ce N, then it follows from Theorem 1.1 that c(a — b)eN or 
€O; hence, —{c(a — b)] = cb — cae P or € O, So that cb > ca. 

In either case, the sign of equality holds if and only ifa = b. 


2.5 Subtraction 


THEOREM 2.4. If a>b and c>d, then a~d>b—c. If 
a> b, and c is any real number, then a—c>b—c. 

More generally, if a>b and c>d, thna—~d>b-—c, with 
a—d=b-—c ifandonly ifa=b andc=d. 


Note that dis subtracted from a, and c from b—not c from a, or 
d from b. 

Thus, by subtraction, the inequalities 7 >6 and 5 >3 yield 
7—3>6-—5, that is, 4> 1; but the inequality 7~—-5>6—-—3 
is false. Or, to illustrate the rule in terms of “<”, the inequalities 
-—~5< 10 and —4< —3 yield —5 — (—3) < 10 — (—4); that is, 
-2< 14. 


Proor. Applying the rule for multiplying an inequality by a nega- 
tive number (Theorem 2.3) to c > d, we get —c < —d, that is, 
—d> —c, where the sign of equality holds if and only if c=d. 
Now applying the rule for adding inequalities to a > 5 and 
-—d> -c, we obtain a + (—d) > b+ (-c), or a—d>b—c, 
with a ~ d= b — cif and only ifa = bandc=d. 
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Exercises 


1. Show that ifa <b thena< }(a+b)<b. 
2. Show that 
(a? — b?)(c? — d?) < (ac — bd)? 
and 
(a2 + b2y(c? + d2) > (ac + bd)? 
for all a, b, c, d, and that the signs of equality hold if and only if ad = be. 
3. Show that 
(a? — b)? > 4ab(a ~ b)? 
for all a, 6, and that the sign of equality holds if and only if a = 6. 
4. Prove the general “>” transitivity rule by mathematical induction. 


2.6 Multiplication 


THEOREM 2.5. If a>b>0 and c>d> 0, then ac> bd. 
More generally, if ay > b, >0,a2 > be >0,...,an > bn > 0, 
then 


(2.2) Q1Q2+°+Qn > bybz +++ by. 


The sign of equality holds in (2.2) if and only if ay = by, a2 = be,..., 
ay, = bn. 


Thus, from 2 > 1 and 4 > 3 you obtain (2)(4) > (1)(3), or 8 >3. 
But note that —1 > —2 and —3 > —4, yet (—1) —3) << (—2xK(—-4); 
accordingly, the requirement that the numbers be positive cannot be 
dropped. 


PROOF. We shall give a proof by mathematical induction. The 
standard procedure in constructing such a proof consists of the fol- 
lowing steps. First, the statement to be proved for all positive 
integers n is tested for the first one or two; then, under the assump- 
tion that the statement is true for all integers up to and including a 
certain one, say k — 1, it is proved that the statement is true also 
for the next integer, k. Since k can be any integer >1 (in particular, 
let kK — | = 1 or 2, for which the statement was verified), we may 
conclude that the statement is indeed true for all positive integers. 

For n= 1, the conclusion a; > b; of Theorem 2.5 is simply a 
repetition of the hypothesis. This observation is sufficient for the first 
step in the proof by mathematical induction, but we shall give the 
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proof also for n = 2; that is, we shall show that if a, > b, > 0 and 
a2 > be > 0, then aQ1Q2 > bybe. 

The inequality 


(2.3) 402 > byiae 


follows from the rule for the multiplication of an inequality by 
a positive number; the sign of equality holds if and only if a; = by. 
The inequality 


(2.4) bya2 > bybe 


follows from the same rule; the sign of equality holds if and only if 
az = bz. Now, the desired inequality 


(2.5) 4142 > byb2 


follows from (2.3) and (2.4) by the transitivity rule (Theorem 2.1); 
the sign of equality holds in (2.5) if and only if it holds in (2.3) and 
(2.4), that is, if and only if a; = b; and az = by. 

We have demonstrated that the inequality (2.2) holds for n = 1 
andn=2. 

Assume that the inequality (2.2) is true form = 1,2,...,k—1, 
that is, for products of A — 1 numbers: 


(2.6) yd +++ ay-1 > Biba ++ Ber; 


equality holds if and only if a, = b1, az = bo,..., Qx_1 = bx-1. 
Then from the rule for multiplication by a positive number 
(Theorem 2.3), when we multiply the inequality (2.6) by a, we obtain 


(2.7) (a2 +++ Ax-1)ax > (bbe +++ by-1)ax; 
the sign of equality holds if and only if 
QyQ2 +++ Ay_y = bybz +++ by_1. 
Also, from the same rule, when we multiply the inequality 
an > by 
by bybe +++ by_1, we get 
(2.8) (byb2 +++ bx-s)ax > (bibs +++ be-1)bk 


the sign of equality holds if and only if a, = b,. It now follows 
from (2.7) and (2.8), by the transitivity rule, that 


GQ °*+ Ax_1Ay > bybg +++ by_rbe; 


equality holds if and only if a, = 51, a2 = bo, ..., ax = bx. 
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2.7 Division 


THEOREM 2.6. If a>b>0 and c>d>0, then a/d> b/c. In 
particular, fora=b=1, ife>d>0, then 1/d>\/c. 

More generally, if a>b>0 and c>d>0, then a/d> b/c, 
with a/d = b/c if and only if a=b and c=d. 


Note that a is divided by d, and b by c—not a by c, or b by d. 

Thus, by division, the inequalities 7 > 6 and 5 > 3 yield 
7/3 > 6/5; but the inequality 7/5 > 6/3 is false. 

The two given inequalities also yield 1/6 > 1/7 and 1/3 > 1/5. 


Proor. We have 


The denominator cde P by Axiom II, since ce P and de P. Also, 
since a > b and c > d, it follows from Theorem 2.5 that ac > bd; 
hence the numerator ac — bde P or € O, and actually ac — bde P 
unless a = 6 and c = d. By Theorem I.1, the product of a nega- 
tive number and a positive number is negative; but the product 


cd ‘ac — 4) 


is equal to the nonnegative number ac — bd, and cd is positive. 
Hence 


ac—bd_a_b 
cd dic 


is nonnegative. Thus a/d > b/c, with a/d = b/c if and only if 
a=bandc=d. 


Exercises 


1. From the inequality 


show that 
2 
(vay + (7b) < Vb 


for all positive a, b. Under what circumstances does the sign of equality 
hold? 
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2. Show that the sum of a positive number and its reciprocal is at least 2; 
that is, show that 


a+ i >2 
for all positive values a. For what value of a does the sign of equality hold? 
3. Show that 
a+6h2+¢c2 >ab+be+ca 
for all a, b,c. 


4. Show that 
(a? ~ b2\(a4 — bt) < (a — b3)2 
and 
(a2 + b?)(a* + b4) > (a3 + 53/2 
for all a, b. 
5. Show that 


ab + b?c + c2a + ab? + be? + ca? > babe 
for all nonnegative a, b, c. 
6. Show that 
(a? — b?)? > (a — byt 
for all a, b satisfying ab > 0 and that 
(a? — b?)? < (a —b)* 
for all a, b satisfying ab < 0. 
7. Show that 
a3 + b3 > a2b + ab? 
for all a, 5 satisfyinga +b >0. 


8. For any of Exercises 3 through 7 that you have worked, determine under 
what circumstances the signs of equality hold. 


2.8 Powers and Roots 


THEOREM 2.7. If a>b>0, ifm and nare positive integers, and 
if al” and b1/” denote positive nth roots, then 


amin > bm and RS ae mn, 


More generally, if a> b> 0, ifm is a nonnegative integer and n 
§ jg gat g 
a positive integer, and if a\/" and b\/™ denote positive nth roots, then 


(2.9) aqnin > bm/n and bomn > qnmn 


with a™/” = b™/n and b-™/ = a-™/” if and only if either (i) a= b 
or (ii) m=0. 
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For a= 9 and b=4, certain values of a%/, b™/™, b-m/n, q-m/n 

appear in Table 2. For each positive value of m/n you.will note that 
Qm/n > qm/n 5 while 4-™/" > Q-m/n | 


TABLE 2. Samples of Powers of Numbers 


Proor. If m = 0, then a/" = 6™/" = b-m/n = g-m/n = |, so that 
the sign of equality holds in (2.9) in this case. 

If m0, then a™ > b™ by the rule for multiplication of inequali- 
ties (Theorem 2.5); the sign of equality holds if and only if a = b. If 
it were true that a1/" < b1/", then it would also be true that 
(al/n)n < (b1/")", or a <b; but, by hypothesis, a > b. Accordingly, 
ai/n > bln, Therefore a”/" > b™/", with am/" = b™/™ if and only 
ifa=b. 

For negative exponents, let 


qm/n = c, bm/n = da. 
Then 


’ 


= l = l 
m/n — 2 TUN mo 
a Pi b ae k: 


Since we have just shown that 
c>d, 
it follows from Theorem 2.6 that 
1.1] 
-_> = 
d~c 
that is, 
b-m/n > qem/n | 


the sign of equality holding if and only if c = d, that is,a = b. 
The rule can be extended to positive and negative irrational 
powers. 
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Exercises 


1. Show that 
a+b ¢ + ey 2 
a Oe ee 


for all a, b. Under what circumstance does the sign of equality hold? 
2. Show that if a, 6, and c, d are positive (and c and d are rational), then 
(ac — beat — bt) > 0 
and 
actd 4 betd > arht + athe, 
Under what circumstance do the signs of equality hold? 
3. To what does the second inequality in Exercise 2 reduce in the case 
c=d= 1? in the casec=d= 4? 
4. For bd > 0, show that a/b < c/d if and only if ad < bc, and that the 
sign of equality holds in each place if and only if it does in the other. 
5. Show that if a/b < c/d, then 
a + b <¢ + qe 
and that the sign of equality holds if and only if ad = be. 
6. Show that if a/b < c/d, with a, b, c, d positive, then 


a ¢ 
Faah aa gga Se 
atb-c+d 


and that the sign of equality holds if and only if ad = bc. 
7, Show that if a/b < c/d. with 6 and d positive, then 
BS bea Sa 
and that the sign of equality holds if and only if ad = be. 

8. Verify. by the test given in Exercise 4, that the four inequalities in the 
conclusion of Exercises 5, 6, and 7 are valid for the values a = 2,b = 3, 
c=S.d=6. 

9. Write the “<" inequality rule equivalent to the first paragraph of the 
“>” inequality rule for transitivity, addition, multiplication by a number, 
subtraction, multiplication, division, and powers and roots. 


CHAPTER THREE 


Absolute Value 


3.1 Introduction 


In Chapter 1, as you will recall, the inequality a > b was defined 
in terms of the set P of positive numbers. You may also recall that 
for the validity of several of the results of Chapter 2, such as Theo- 
rem 2.5 concerning the multiplication of inequalities, it was necessary 
to specify that certain of the numbers involved should be positive. 
Again, in many instances the fractional powers of numbers that 
appear in Theorem 2.7 would not even be real if the numbers them- 
selves were negative; consider, for instance, a!/? with a= —9. Many 
of the fundamental inequalities, which will be derived in Chapter 4, 
involve just such fractional powers of numbers. It is natural, then, 
that we should often restrict our attention to positive numbers or to 
nonnegative numbers (positive numbers and zero) in this study. 

In applied problems involving inequalities, we often deal with 
weights, volumes, etc., and with the magnitudes, or absolute values, 
of certain mathematical objects such as real numbers, complex num- 
bers, vectors. The magnitudes of all these are measured by nonnega- 
tive numbers. Thus, even though you may choose to denote gains by 
positive numbers and losses by negative numbers, a loss of $3 is still 
a loss of greater magnitude than a loss of $2; the absolute value of 
—3 is greater than the absolute value of —2. 
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In this chapter we shall define and study some properties of the 
absolute value of real numbers. for application to inequalities in sub- 
sequent chapters. We shall also exhibit the graphs of some interest- 
ing but rather “off-beat” functions involving absolute values and 
shall present some new ideas regarding them. 


3.2 Definition 


The absolute value of the real number a is denoted by |a{; it can 
be defined in a variety of equivalent ways. We shall consider several 
of the definitions here. 


DEFINITION. The absolute value |a| of the real number a is defined 
to be a if a is positive or zero, and to be —a if a is negative. 


Thus. |2/=2. [O}=0. and |—2|= —(-—2)=2. 

The principal disadvantage of the foregoing definition is that it is 
unsuitable for algebraic manipulation. Thus (see Theorem 3.2 later 
in this chapter) for all a, b it is true that 


la+ b|<|al+|], 


as you can verify by considering separately the cases in which a and 
b are both positive. one positive and the other negative, both nega- 
tive. one zero and one positive. one zero and one negative, and both 
zero. But it would be preferable to give a unified proof of such 
a result by standard algebraic procedures; this will be done in Sec. 
3.8, after a different but equivalent definition of absolute value in 
terms of squares and square roots will have been given. 
We could rephrase the above definition somewhat differently: 


The absolute value |a| of the real number a is 0 if ae O, and other- 
wise |a| is the positive member of the set {a, —a}. 


Thus. if @ = 2, then |a| is the positive member of {2, —2}.1e., 
2: ifa = —2, then |a| is the positive member of {—2, —(—2)}. Le.. 
2. But this characterization of |a| has the same algebraic disadvan- 
tages as the preceding one. 


3.3 Special Symbols 


The next two characterizations of {a| depend on two useful spe- 
cial symbols. max { } and { }*, which we shall now define. 
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For any set {ay, a2, . .., Gn} of real numbers, the symbol 
max {41, G2, ..., Qn} denotes the greatest member of the set. 


If there are only one or two members in the set, we still say 
“greatest” in this connection; and if the greatest value is taken on by 
more than one of the members, then any one of these is understood 
to be the greatest. Thus, 


max (3,7,0,—2,5} = 7, max {4,4} =4, max {—-3,—-l}=-—1l. 


With some difficulty, arithmetic operations can be performed on 
expressions involving the symbol max { }; for example, 


(max {4, —3})(max {0, 5}) + max {—4,4} — max {—9, —8} _, 
2 max {1,4} ces 


In particular, consider max {a, —a}; ifa = 2, then 

max {a, —a} = max {2, —2} = 2 =|a}; 

ifa = —3, then 
max {a, —a} = max { —3, —(—3)} = 3 =|a|; 

if a = 0, then 

max {a, —a} = max {0,0} = 0 =|{a|: 
and so on. Thus, for all a, 
(3.1) max {a, —a} =|al, 


so that (3.1) gives us another characterization of |a\. 
Let us turn now to the consideration of a second special symbol. 
The symbol {a,, a2,...,@n}* denotes the greatest member of the set 
{ay, d2,..., Qn} if there is at least one nonnegative member of the set; 
but if all the members of the set are negative then the symbol denotes 0. 
Thus 


{3,7,0,-2,5}*=7, {4,4}*=4, {(-3,-]1}*=0. 


As in the case of max { }, it is awkward but possible to deal 
arithmetically with expressions involving the symbol { }+*; for 
example, 


({4, —3}*{0, 5}*) + {—4,4}* — (-9, —8}* _ 3 
2{1,4}* cas 
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The symbols max { } and { }+ are not equivalent, as the above 
examples show: in fact, you can easily see from the definitions that 


{ay, d2,..., ay}* = max {0, a1, @2...., an} 
= max | 0, max {a1.42,....4n}! ; 
Thus it follows that 
{1,42,...,Qn}* > max {a).a2,..., an}, 

where the sign of equality holds if and only if there is at least one 
nonnegative member of the set (a1, ag,....dn}. 

Accordingly, since the special set {a, —a} does have a nonnega- 
tive member for all a, 


{a, —a}* = max {a, —a} =|a|. 


Thus, the equation 


(a, —a}* = |al 


might also be considered as a definition of \a|. 


Exercises 
1. Determine the values of 
(a) max {—7, —4, —1}. (f) {-7.-4-1}t. 
(b) max (3,7. /2}. (g) {3.. /2}*. 
(c) max {—7,0, —1}, (h) {—7,0, —1}*, 
(d) max (0, 4,1}, (i) {0,4, 1}*. 
(e) max {3, —3, 3), (Gj) {3, —3, 3}*. 

2. For any set {a1,a2...., a,} of real values, the symbol min {a}. a2,...,4n} 
denotes the least member of the set {a;,a2,.... a,}, and the symbol 
{@1,d2...., a,}~ denotes the least member of the set {0, a1, @2,...,@n}. 
Determine the values of 

(a) min{—7,—-—4,-1}. (ff) {-7.-4,-1}-, 
(b) min (3,7, 2}, (g) {3.7 V/2}>. 

(c) min {—7,0, —1}, (h) {—7.0, —1}-, 
(d) min (0, 4.1}. (i) {0,4, B}-, 


(e) min {3, —3, 3}, Gj) (3, ~3,3}-. 
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3. Determine the value of 
(max { —1,-2})({ —1, —2}*) — (min (1, 23)({1.2}7). 
4. Show that 
max { max {a, b,c}, max {d, e}| = max {a. b,c, d. e}. 
5. Give an example showing that the inequality 
max {a,b} + max {c,d} > max {a, 6, c,d} 


is not always valid. 


6. Show that 
{a, b}+ + {c,d}* > {a,b,c,d}*. 
7. Show that 
{ay, d2,...,@n}* > max (a), @2,...,4n} 
> min {a1, a2,..., an} 
> {@1,a2..... an} 
Is there any set for which the strict inequality sign holds in all three 
places? 
8. Show that if a = max {a, b,c}, then ~a = min {—a, —), —c}. 
9. Show that {—a, —b}- = —{a, b}*. 


10. Show that max {a;, @2,...,@n} = max 4a), Max {a2, a3,... say)} 


3.4 Graphical Considerations 


A graphical representation can furnish a vivid and striking picture 
of the behavior of a function, whether we are dealing with mean 
daily temperatures, the fluctuations of the stock market, |x|, or 
what-not; for one thing, a graph lets us see at a glance some of the 
over-all properties of the function that otherwise might have been 
obscure. 

For example, the symbols max { } and { }* are made more 
meaningful by a consideration of the graphs of 


| 1 3 7 
2 2’ 3x-4| 
and 
sel ies: corgi hale 
y= a" 2’ 4* ay 


which are shown in Figs. 3.1 and 3.2, respectively. In these figures, 
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the graphs of 
Silas L Ake: aoe 
PRX = 5 and y= x 


are extended as dashed lines. 


Figure 3.1. 


3 7 | 
Grd y= me me aio: Se? See eee 
raph of DJ Maxi) X . x I 


Let us now construct the graph of the function defined by the 
equation y = |x|. This graph provides us with a visual characteriza- 
tion of absolute value. \t will be sufficient for our purposes to restrict 
our attention to the incomplete graph corresponding to the interval 
-3<x<3. 

In the construction of this graph, it is helpful and interesting to 
consider first the graph of »’ = x, that is, the graph of the set 
of ordered pairs of real numbers (x, »’) such that y’ = x, and also 
the graph of y’” = —x, as shown in Figs. 3.3 and 3.4, respectively. 


Figure 3.3. Figure 3.4. 
Graph of y= x, -3 <x <3 Graph of y” = -—x, -3 <x <3 
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From these figures and the definition 
|x| = max {x, —x} = max {y’, y’}, 
you can see at once that the graph of y = |x| is simply the graph of 


y = max {y’, y’”}, as illustrated in Fig. 3.5. Thus for each abscissa x, 
the greater of the ordinates y’ and y” was chosen from Figs. 3.3 and 


3.4 as the ordinate y in Fig. 3.5. For example, when x = —2, the 
greater ordinate is y’ = 2; when x = 1, the greater ordinate is 
y = 1, ete. 

Figure 3.6 shows the graph of y = — |x|. 


Figure 3.5. Figure 3.6. 
Graph of y={xj, -3 <x <3 Graph of y= —|x|, -3<x <3 


Looking at the four graphs in Figs. 3.3 through 3.6, you will note 
that for no value of the abscissa x is any one of the corresponding 
ordinates less than — |x| or more than |x|. Explicitly, from Figs. 3.3, 
3.5, and 3.6, you can read off the following result, which of course 
you might have discovered and proved without considering the 
graphs at all: 


THEOREM 3.1. For each real number a, 
—|a|<a<fal. 
The first sign of equality holds if and only if a <0, and the second if 
and only if a>0. 


Theorem 3.1 follows, for instance, from the fact that a = —|a| if 
aeN or aeO, and a=|a| if ae P or ae QO, and from the fact 
(see Exercise 8 in Chapter 1) that any positive number is greater 
than any negative number. 


32 AN INTRODUCTION TO INEQUALITIES 

Now, for exercises, let us consider the graphs of some more com- 
plicated functions involving absolute values. 

Consider first the graph of 


y =F +1). 
For x > 0, |x|= x, and hence 

yolotx=x; 

2 

but for x <0, |x| = —x, so that 

yo F(x —x)=0. 
This graph, which is shown in Fig. 3.7, could easily have been pic- 
tured from a consideration of Figs. 3.3 and 3.5 by taking the average 
of the ordinates y’, y for each abscissa x. 


You might observe that the graph shown in Fig. 3.7 is also the 
graph of y = max {0, x}, as well as the graph of y = {x}+. Thus, 


{x}* = max {0,x} = Fox + |[x|) 


for all x. 
Now let us look at the graph of 
(3.2) yoxt ll tix] +]x—1|-3 


in the interval —2 < x < 2. For 1 < x, the terms in the right-hand 
member of (3.2) can be written thus: 


2|x +1) =2x + 2, |x{= x, Ix-I]=x-1, -3=-3, 
so that for 1 < x we have 
ypolxt+24+xu+x-1-3= 4-2. 


For 0 < x < 1, the first two members in the right-hand member of 
(3.2) can be written as before, but 


Jjx-I]J=1l-~x, not jx—-fj=x-]. 
(Can you explain why?) Accordingly, forO< x <1, 
ypodxt24+x+]l—x-3=2x, 
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Similarly, for —1 << x <0, 


ypo2xt+2-—-x+1—x-3=0; 
and forx < —1, 
yo —2x-2-x4+1-—x-—3= —4x-4. 


Figure 3.7. Figure 3.8. Graph of 
Graph of y = 4(x + |x|), -3 $x <3 y=2xt Vt [x] +ix— 1] -3, 
—2<¢x¢2 


Thus eq. (3.2) is equivalent to a different linear equation in each of 
the foregoing intervals. Plotting the corresponding linear segments in 
the appropriate intervals, we obtain the continuous graph shown in 
Fig. 3.8. 


Exercises 


1. From a consideration of Figs. 3.4, 3.5, 3.6, obtain a result for —a analog- 
ous to Theorem 3.1 for a. 


2. For —3 < x < 3, sketch the graphs of 
(@) y=F@-IxD 0) y=hUxI- 9, © y= FO + Ix) 


3. Determine which of the graphs in Exercise 2 is also the graph of 


(d) y = min (0, x}, (g) y = {x}, 
(e) y = max {0, —x}, (h) y = {~x}*, 
(f) y = min (0, -x}, (i) y= {=x}. 
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4. For —3 < x <3. sketch the graph of 
(a) iy = max {x —x - 24, (c) p= {x, —v—2)}*, 
(by y= min {x —v — 23, (dq) vs {x =x —2)}-. 
5. For --3 <x < 3. sketch the graph of 
vw dix -bp—lvp+ 2px +t 1)—5. 

6. A function / defined by an equation y= f(x) is said to be even if 
J(—x) = f(x) for all x, and is said to be odd if {(-—x) = —f¢x) for all x. 
Thus, the function defined by ry = x? is even since (—.v)? = x2, and the 
function defined by y = x3 is odd since (—.x)? = —.x3: of course, some 
functions are neither even nor odd. Which of the functions illustrated in 
Figs. 3.3 through 3.6 are even and which are odd? 


3.5 The “Sign” Function 


Another function closely related to [x] is illustrated in Fig. 3.9. 
It is the function denoted by y = sgn x (read * ‘sign of x.” but not to 
be confused with sin x). and defined by the equations 


sgnx= +1 for x >0. 


(3.3) sgn.v = 0 for x =0, 
sgnx = —1 for x <0. 
y 
‘ 
2 
(a 
Cut of eu 


Figure 3.9. Graph of vy = sgnx, -3 ox <3 


In the graph. the points corresponding to (0, 1) and (0, — 1) are en- 
closed in unshaded, or open, circles to emphasize the fact that they 
are not included in the graph, and the point corresponding to (0, 0) 
is covered by a shaded circle to emphasize that it /s included. 
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The function y = sgn x is related to y = |x| through the notion of 
slope, which is defined as follows: 

Let L be a nonvertical line in the coordinate plane, and let 
P1:(X1,y1) and P2:(X2, y2) be distinct points on L; see Figs. 3.10 and 
3.11. In going from P; to Pe, the vertical rise is the directed distance 
y2 — yi, and the horizontal run is the directed distance xz — x1 #0. 
Of course, either the rise or the run or both might be negative; thus 
in Fig. 3.11, the rise ye — y1 is negative (actually, then, a fall). The 
ratio of the rise to the run, which has the same value for all pairs of 
distinct points P;, Pz on L, is defined to be the slope m of L: 


Pilxysy) 
Pela yy) 


Figure 3.10. Figure 3.11. 
A line L having positive slope A line L having negative slope 


It should be easy for you to verify that the slopes of the linear 
graphs of y = x and y = —x are 1 and —1, respectively. See Figs. 
3.3 and 3.4. 

Now consider the slope of the graph of y =|x|, shown in Fig. 
3.5; at the same time, keep in mind the ordinate of the graph of 
y = sgn x shown in Fig. 3.9. ; 

For x > 0, the graph of y = |x| coincides with the linear graph of 
y =x, and it has slope m= 1. Also, for x >0, the graph of 
y = sgnx has ordinate y = 1. 

For x < 0, the graph of y =|x| coincides with the linear graph 
of y = —x, and it has slope m = —1. Also, for x <0, the graph of 
y = sgn x has ordinate y = —1. 

For x = 0, the slope of the graph of y =|x| is undefined, but you 
can say that the right-hand slope at (0,0) is | and the left-hand slope 
at that point is — 1. The average of these slopes is 4{1 + (—1])} = 0. 
Also, for x = 0, the graph of y = sgn x has ordinate y = 0. 
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Thus, y =|x| and y = sgn x are related geometrically as follows: 

For x #0, the value of y = sgn x is equal to the value of the slope 
of the graph y = |x|; and for x = 0, it is equal to the average of the 
values of the right-hand and left-hand slopes. 

It is interesting to note that although we have been analyzing two 
relatively simple functions, y =|x| and y = sgnx, the graph of 
y =|x| is somewhat peculiar in that it does not have a continuous 
slope; and the graph of y = sgn x is even more strange in that it is, 
itself, discontinuous. We shall not here attempt to define continuity 
and discontinuity, but the meaning should be intuitively clear in the 
present instances. 

The function defined by y = sgn x is related to y =|x| ina sec- 
ond instructive way. A moment’s reflection shows that, for each 
real number a, 


asgna=|a|; 


accordingly, this equation gives us another characterization of the 
absolute value |a| of a number a. 


Exercises 


1]. For —3 <x <3, draw lines passing through (0,0) and having slopes 
(a) m= 0, (b) m= 34, (Cc) m= —-]. 
2. Sketch the portions of the graphs of 


(a) y=x, (bt) yox+l, (C) y=xte2, 
(d) y=xt+3, (0-) y=x-l, (ff) y=x-2, 
that lie in the square -Il <x <1, —1 <y <I. It is either impossi- 


ble or extremely easy to do part (d); which? why? 
3. For —3 <x <3, sketch the graphs of 
(a) y = (x + I)sgnx, (b) y = xsgn(x + 1). 


4, For -3 <x < 3, sketch an (x, y) graph such that, for each x, the value 
of y is the slope of the graph in Fig. 3.6; for any x where there is no defi- 
nite slope, use the average of the right-hand slope and the left-hand slope. 


3.6 Graphs of Inequalities 


Before we leave the visually instructive subject of graphs, let us 
investigate a few inequalities involving absolute values. 
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Consider, for example, the equation 


jx{=1 
and the inequality 
Ix|<1. 
The equation has just two solutions, namely 
x=1l and#$x=-—1; 


but the inequality has the entire interval 


-l<x<l 

as solution. 
Again, the equation 

|x — 1} =2 

has just the two solutions 
x=-! and x=3; 

but the inequality 

|x —1]<2 
has, as solutions, each value x satisfying 

—-l<x<¢3. 


See Fig. 3.12. 


Figure 3,12. Graph of the inequality |x — 1|< 2 


To find the solution of the inequality 
(3.4) Ixl+ isl, 


we consider values in each of the quadrants of the (x, y)-plane sepa- 
rately. Thus in the first quadrant, where 


x>0 and y>O, 
the inequality (3.4) is equivalent to 
x+y<l. 
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We draw the portion of the line 


v+yel, 

i.e., of the line 
yol-—x, 

that lies in the first quadrant; and since we seek the solution of 
yesl—-x, 


our graph consists of those points in this quadrant that lie either on 
or below the line. 

The graph is shown in Fig. 3.13, while the entire graph of the 
inequality (3.4) appears in Fig. 3.14. 


Figure 3.13. Figure 3.14. 
Graph of [xf+ir[<l. x20. y>0 Graph of Jxj+ rl <i 


Figure 3.13 might be considered also as showing the graph of the 
solution of the set of inequalities 


x+ty<cil 
(3.5) x>0 
y>o0. 


In Fig. 3.15 the shaded regions in (a), (b), and (c) comprise the solu- 
tions, respectively, of the first, second, and third inequality in (3.5). 
the triply shaded region in (d) is the simultaneous solution of all the 
inequalities (3.5). Thus, though the set of equations 


xXx+V>= ] 
0 
0 


x 
¥ 
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is satisfied by no pair of values (x, y), the set of inequalities (3.5) has 
an entire area of solutions, ; 

You will recall that, somewhat similarly, the equation [|x| = 1 has 
only two solutions while the inequality |x| < 1 has an entire interval 
of solutions. Thus the set of all solutions of an inequality (or set of 
inequalities) is often much “richer” than the set of all solutions of 
the corresponding equation (or set of equations). 


Exercises 


1, In Fig. 3.15(d), the plane is separated into seven differently shaded por- 
tions. Each, together with its boundary, constitutes the solution of a set 
of three inequalities; for example, one set is x <0,y>0,x+y>1. 


(c) 


(¢) 
Figure 3.15. Incomplete graph of (a)x +y <1, (b)x >0, and (c)y > 0; triply 
shaded graph (d) of the intersection of (a), (b), and (c) 


Give sets of inequalities for each of the areas. 
2. Sketch the graphs of 


(a) |x|—ly|>1 for -2<x<2, (b) |x} 42IyI<l. 
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3. Sketch the graph of the system 
yoxt3 
—2<x<2 
y>0. 
4. Sketch the graph of the system 
ax+t+y sh 
x—-ysl 
x+2y <7. 
5. Sketch the graph of the system 
ax+y>5 
x-r>1 
x+2y >7. 


3.7 Algebraic Characterization 


Our next, and last, characterization of |a| perhaps seems, at first 
glance, to be the /east desirable as a definition, for it appears to be 
devious and unnatural; but it has the virtue of being algebraically 
the most tractable, and accordingly it is the one we shall most often 
use. 

To this end, consider the following: If a = —2, then 

a@=(-22=4, V4=2={al, so that Vat =|al; 
if a= 0, then 

a=0=0, ~0=0=|al, so that Vat = lal; 
and if a = 2, then 
a@=2=4, V4=2={al, so that Va? =|al. 
Similarly for all real a, 


va = jal, 


and this is our algebraic characterization of \a\. 
Actually the characterization |a| = \/a? expresses the special case 
b = 0 of the Pythagorean relationship, 


c= Va + B, 
between the lengths a, b of the sides and the length c of the hypote- 
nuse of a right triangle. Thus the absolute value of the real number 


a can be interpreted as the length, or magnitude, of the line segment 
from the origin to the point representing a on the number scale 
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(Fig. 1.1). The reader who continues to study mathematics will soon 
learn that this last characterization of absolute value (and the accom- 
panying geometric concept of length) can be appropriately general- 
ized so that absolute value may be defined also for other mathematical 
objects, e.g., vectors and complex numbers. 

Two points to note in the foregoing algebraic expression of |a| 
are, first, that a? is nonnegative, so that its square roots are real, and 
secondly, that by definition the symbol \/ means the nonnegative 
square root. To be sure, (+2)? = 4, so that 4 has two square roots— 
namely +2; but in algebraic manipulations the symbol \/4 denotes 
only 2, not —2. For example, consider the equations 


5+V4=7, 5- V4=3, 
5~— /4=7, 5+ /4=3. 


The first two of these equations we consider to be valid, and the sec- 
ond two invalid, but only because the symbol \/ always means the 
nonnegative square root. It is for this reason, also, that + appears 
before the symbol /_ in the familiar formula for the solutions of 
the quadratic equation 


ax? +bx+cec=0, a0, 


and 


namely ; — 
_ -~bt Vb — 4ac 
x= oa : 
Exercises 


1. You can see from the Pythagorean relationship that, for the equation 
x2 + y? = 2 ; 
the solution set (locus) of points (x, y) constitutes the circle with center at 
the origin and radius r. Determine the solution set of values (x, y) for the 
inequality 
x2 + y? ¢ 25. 
2. The absolute value of the complex number x + éy is defined by 
Ix + iy] = Vx? +7. 
Representing x + iy by the point having coordinates (x, y) in the plane, 
’ determine the solution set for 


I<|x+H]$2. 
3. Determine the locus of points x + éy for which 
|x + iy + Lf =|[x + iy — 1. 
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3.8 The “Triangle” Inequality 


The inequality (3.6) in Theorem 3.2 below, to which we have 
referred earlier in this chapter, is often called the “triangle” inequal- 
ity for geometric reasons that will be discussed further in Chapter 4. 
Here is a complete statement concerning the inequality: 


THEOREM 3.2. For all real numbers a and b, 
(3.6) jaj+jb[>la+ dl]. 


The sign of equality holds if and only if ab >. that is, if and only if 
aand b are either both >0 or both <0. 


For example, if a = 5 and 6 = —2, then 


(aba seanle% 
while 
ja} + |b] = |5|+}—2|= 7. 


But ifa = —5 and 6b = —2, then 


ja + b[ =|(—5) + (—2)[=7 
and 
jaj+fb|={|—-5]+]—2|=7. 


You can grasp the validity of the inequality (3.6) intuitively by 
noting that if a and 4 are of opposite sign, then on the right-hand 
side of the inequality the numbers a and b will “work against each 
other” and emerge from the encounter with diminished joint magni- 
tude, whereas on the left-hand side they are forced to “pull together” 
positively from the start. 

On the same basis. you can understand the inequality 


(3.7) ja — bl > |ja}—loI]. 


in which the sign of equality holds again if and only if ab > 0 (ie., 
a and & are both >0 or both <0). Here, on the right-hand side of 
the inequality, the numbers a and b are always forced to “settle their 
difference,” whereas on the left-hand side, if a and b are of opposite 
sign, then they will pull together to increase their joint magnitude. 

Inequalities (3.6) and (3.7) can be proved by using the algebraic 
definition |a@| = \/a?. Thus inequality (3.6) can be restated equiva- 
lently as 


(3.8) Va + VB > Ma +b. 
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Now, (3.8) is equivalent to 


(3.9) (Va? + /B?)? > (a + bY; 


that is, the validity of either (3.8) or (3.9) implies that of the other. 
Thus (3.9) follows from (3.8) by squaring (see Theorem 2.5), and (3.8) 
follows from (3.9) by taking nonnegative square roots (see Theo- 
rem 2.7). 

Next, (3.9) can be rewritten as 


a? + 2\/a®b? + b2 > a? + 2ab+ b?. 
Accordingly, (3.9) is equivalent to 
(3.10) Va?b? > ab 


by the inequality rules for addition, subtraction, and multiplication 
by a positive number. But 


Vaeb? = \/(aby* = |ab|, 
so that (3.10) is equivalent to 
(3.11) |ab| > ab. 
Thus (3.6) is equivalent to (3.11). 

Now, (3.11) is valid by Theorem 3.1, which states that any real 
number is less than or equal to its absolute value. The sign of equality 
in (3.11) holds if and only if ab > 0. Therefore the inequality (3.6), 
which is equivalent to (3.11), is also valid, and the sign of equality 
holds if and only if ab > 0. 


The inequality (3.7) can be proved similarly after it has been written 
in the equivalent form 


VG= BF > la = VBP. 
However, it is interesting to note that (3.7) can also be derived 
directly from (3.6). Thus, substitution of the real number a — 6 for 
the arbitrary real number a in (3.6) yields 


la — bl +|b|>|a—b + by, 


or 

ja—b|+ jb|>|Ial, 
whence 
(3.12) ja — b|>|al— |b] 


by Theorem 2.4, the subtraction rule. Similarly, substitution of b — a 
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for b in (3.6) yields 


laj+|b-—a|>la+b—al, 


or 
ja|+|a— b|>|d|, 

whence 

(3.13) ja — b| > |b|—|a. 

Since 


||a] —|b\| = max {(|a] —|6}), — (Ja| —|b])} 
= max {(/a] —|5]), (16| —|a})}. 


it follows from (3.12) and (3.13) that 
Ja — b{> |[a|—|d| 


Hence (3.7) is valid, the sign of equality holding if and only if it holds 
either in (3.12) or in (3.13), 

Since a — b was substituted for a in deriving (3.12), the sign of 
equality holds in (3.12) if and only if (@ — b)b > 0, ie., ab > b?. The 
latter inequality is valid if and only if ab > 0 and |a| > |d|. Simi- 
larly, the sign of equality holds in (3.13) if and only if a(b — a) > 0, 
that is, ab > a?. This is valid if and only if ab > 0 and |b| > [a]. 
Since at least one of the inequalities |a| >|b| and |b| > |a| holds, 
it follows that the sign of equality holds in (3.7) if and only if ab > 0. 

It might be noted that since b represents an arbitrary real number— 
positive, zero, or negative—the inequalities (3.6) and (3.7) still hold 
if b is replaced by —b. We thus obtain 


(3.14) ja|+|b|> a — d| 

from (3.6), and 

(3.15) |a + b}> ||a|—|6|| 

from (3.7), with the signs of equality holding if and only if 
a(—b) > 0, that is, ab < 0. Together, inequalities (3.6), (3.7), (3.14), 
and (3.15) can be written as 


(3.16) ja|+ |b] >|ax b|> |Ja|—|d}|. 
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Exercises 
. From the algebraic characterization of absolute value, show for all real 
a, b that 
(a) |-a\=fa], (6) [ab] = a}-|d|, 
and if b 0 that 


a 


5 


=a. 
{5| 
. Determine whether it is the sign < or the sign = that holds in the mixed 
inequality |a — b| > |\a|—j||, if 
(a) a=7, b=29; (dd) a=-9, b= —-—10; 
(bt) a=—n, b= V2; (e-) a=9, b= —-10. 
(c) a=2, b=0; 
. Determine whether it is the sign > or the sign = that holds in the mixed 
inequality Ja|+|b|>|a + d|, if 
(a) a=3, b= +2; (d) a=0, b= -2; 
(bF) a=-—3, b=-2; (e<) a=0, b=0. 
(c) a=3, b=2; 
. Repeat Exercise 2 for the inequality |a + b| > |\a| -—|d| | ; 
. Repeat Exercise 3 for the inequality |a|+ |b|> Ja — b|. 
. Show that the inequality |a — b| > ||a|—|b{| is equivalent to the in- 
equality |ab| > ab. 
. Show that if ab > 0 then ab > min {a?, b?}. 


. Show that each of the other characteristic properties of |a|, given in this 
chapter, follows from |a| = \/a?. 


(©) 


CHAPTER FOUR 


The Classical Inequalities 


4.1 Introduction 


Now that we have forged our basic tools, we shall demonstrate more 
of the magic of mathematics. As an artist evokes, out of a few lines 
on a canvas, scenes of great beauty, and as a musician conjures up 
exquisite melodies from combinations of a few notes, so the mathe- 
matician with a few penetrating logical steps portrays results of 
simple elegance. Often, like the product of the magician’s wand, these 
results seem quite mysterious, despite their simplicity, until their 
origin is perceived. 

In this chapter, we shall employ the basic results derived in the 
previous chapters to obtain some of the most famous inequalities in 
the field of mathematical analysis. These inequalities are the every- 
day working tools of the specialist in this branch of mathematics. 

In Chapter 5, we shall then show how these new relationships may 
be used to solve a number of interesting problems that, at first sight, 
seem far removed from algebra and inequalities. The applications are 
continued in Chapter 6, where we discuss and extend the notion of 
distance. 

This, indeed, is one of the fascinations of mathematics—that simple 
ideas applied one after the other, in the proper order, yield results 
that never could have been envisaged at the outset. 


47 
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4.2 The Inequality of the Arithmetic and Geometric Means 


(a) Mathematical Experimentation. Given two nonnegative num- 
bers, say | and 2, let us obtain their “mean” in the following two ways: 
the arithmetic mean (or half their sum), usually called “average,” 


om 
at = 1.5, 


and the geometric mean (or square root of their product ) 


Vl-2= 141--- 


Observe that 1.5 > 1.41---. Similarly, if we start with the numbers 
3 and 9, for the arithmetic mean we obtain the value $(3 + 9) = 6, 
and for the geometric mean we get the value \/27 = 5.19---. Note 
that 6 > 5.19---. Continuing with various pairs of nonnegative 
numbers chosen at random, say I and 13, $ and 4, and so on, we 
observe in each case that the arithmetic mean is greater than the 
geometric mean. 

Can we safely generalize this discovery and reach some conclusions? 
Our mathematical nose begins to twitch as we scent a theorem. 
Maybe the result is true for all pairs of nonnegative numbers! In 
other words, we conjecture that the arithmetic mean of two nonnega- 
tive numbers is always at least as great as their geometric mean. We 
shall express this conjecture in terms of algebraic symbols and we 
shall see in subsection (b) that our i ala is true. We may there- 
fore state it as 


THEOREM 4.1. For any nonnegative numbers a and b, 
(4.1) 40> ab. 


The sign of equality holds if and only if a = b. 


Note that if one of the two numbers were positive and the other 
negative, then (4.1) would be meaningless since its right-hand side 
would be imaginary.} If both numbers were negative, then the left- 
hand side of (4.1) would be negative and the right-hand side positive, 
so that the theorem would not be valid. 


+The notion of inequality is not directly applicable to imaginary numbers, but only 
to their absolute values. 
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The type of experimentation that led us to Theorem 4.1 repre- 
sents the sort of trial-and-error methods often used by mathemati- 
cians on the trail of theorems. Formerly, it was quite laborious work. 
Nowadays, with the modern digital computer to aid mathematical 
experimentation, we can test thousands and millions of cases in a few 
hours. In this way, we obtain valuable clues to general mathematical 
truths. 


Exercises 


1. Determine the geometric mean and the arithmetic mean for the following 
pairs of numbers: 


(a) 2,8, (b) 3,12, (c) 4,9,  (d) 0,20. 


2. If p is nonnegative, determine the geometric mean and the arithmetic 
mean of the following pairs of numbers: 


(a) p, 9p, (b) Op, (©) _ 2, 2p2. 


(b) Proof of the Arithmetic-mean-Geometric-mean Inequality for 
Two Numbers. Since square roots are a bit bothersome, let us elimi- 
nate them by writing 


(4.2) =c?, b=d?2, 


This is permissible because a and b were assumed, in Theorem 4.1, 
to be nonnegative. The relationship (4.1) that we wish to prove for 
arbitrary nonnegative a, b then becomes 


(4.3) ate > cd 


for arbitrary real c, d. Now (4.3) is true if and only if 
(4.4) C+’ —cd>0, 


which is equivalent to 
(4.5) c? + d? — 2cd > 0 


as a result of our elementary rules for dealing with inequalities. 
We now recognize a familiar friend, namely 


(4.6) c? + d? — 2cd=(c — d)?, 
so that (4.5) is equivalent to 
(4.7) (c—dy>0. 
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Since, by Theorem 1.3, the square of any real number is nonnega- 
tive, we see that (4.7) is indeed true. Thus (4.5) is a valid inequality, 
and hence (4.4), (4.3), and (4.1) are also valid. The sign of equality 
holds in (4.7), and therefore in (4.1), if and only if c — d = 0, that is, 
c = d, or, equivalently, if and only ifa = b. 

Note that, while the inequality (4.1) of Theorem 4.1 applies only 
to nonnegative numbers a, 6, the foregoing proof shows that in- 
equality (4.3) is valid for all real numbers c, d, the sign of equality 
holding if and only if c =d. You will observe that the results of 
Secs. 4.4 and 4.6 also are valid for all real numbers, not merely non- 
negative numbers; this fact increases the geometric significance of 
those results. 


(c) A Geometric Proof. Let us now show that Theorem 4.1 can also 
be obtained geometrically by means of a simple comparison of areas. 
Consider the graph of y = x, as shown in Fig. 4.1. Let S and T be 
points on the line y = x, with coordinates (c, c) and (d, d), respec- 
tively, and consider the points P:(c, 0), Q:(0,d), and R:(c, d), as 
shown. Since OP is of length c, PS has the same length c. Then the 
area of the triangle OPS is c?/2, i.e., one half of the base times the 
altitude. Similarly, the area of the right triangle OQT is d2/2. 


Figure 4.1. Geometric proof of the inequality e + ba > cd 


Now examine the rectangle OPRQ. Its area is completely covered 
by the triangles OPS and OQT, so that 
(4.8) area (OPS) + area (OQT) > area (OPRQ). 


Since the area of OPRQ is cd, length times width, we may write (4.8) 
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in algebraic symbols as 
2 2 

(4.9) cra > ed. 
Now the inequality (4.9) is identical with the inequality (4.3), so that 
our geometric proof is complete. 

Furthermore, we see that there is equality only when the triangle 
TRS has area zero, that is, only when S and T coincide, so that 
c=d. 


(d) A Geometric Generalization. A little thought will show that the 
foregoing arguments remain valid even in cases where the curve OTS 
is not a straight line. Consider the diagram shown in Fig. 4.2. It is 
still true that 


(4.10) area (OPS) + area (OQT) > area (OPRQ). 


0 P ‘ 
Figure 4.2. A more general geometric inequality 
When you have studied calculus and have learned how to evaluate 
the area underneath simple curves, such as y = x*, for arbitrary 
positive a, you will find that this process will yield a number of in- 
teresting inequalities in a very simple fashion. In later sections of this 
chapter, we shall obtain some of these inequalities in a different way. 


Exercises 


1. Let a and b be the lengths of a pair of adjacent segments on a line, and 
draw a semicircle with the two segments together as its diameter, as in 


[+o ->fe—- » | 


Figure 4.3 


Fig. 4.3. Show that the radius r of the circle is the arithmetic mean of a 
and 6, and that the perpendicular distance h is their geometric mean. 
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2. An average that occurs quite naturally in optics and in the study of elec- 
trical networks is the harmonic mean. For two given positive quantities 
a and 6, the quantity c determined by means of the relationship 


a ee 


c ab 
is called the harmonic mean. Solving this equation for c, we obtain 
ee 2 — _2ab 
l/a+ 1/b a+b 
Show that the harmonic mean is less than or equal to the arithmetic mean, 
and also less than or equal to the geometric mean, with equality if and 
only if a = ); i.e., show that 


a+b 2ab 
2 2 Vat 2a 


3. Determine the harmonic, geometric, and arithmetic means of the pairs 
(a) 2,8: (b) 3, 12; (c) 4,9; (d) 5,7; (e) 6,6. 


4. The relationship between distance d, rate r, and time tis d = rt. Show 
that if in traveling from one town to another you go half the distance at 
rate r;, and half at rate re, then your average rate is the harmonic mean 
of r; and rg, but that if you travel half the time at rate r; and half the time at 
rate r2, then your average rate is their arithmetic mean. If r; 4 r2, which 
method would get you there sooner? 


5. Use the result of Theorem 4.1 to solve Exercise 2 on page 22. 


(e) The Arithmetic-mean-Geometric-mean Inequality for Three 
Numbers. Let us now perform some further experimentation. Taking 
three nonnegative numbers, say |, 2, and 4, let us compute their 
arithmetic mean—the simple average—as before: 


1+24+4_433 
5 


Let us also compute their geometric mean, i.e., the cube root of their 
product: 


eee, 


W1-2-4=2. 


We observe that the arithmetic mean of these three numbers is 
greater than their geometric mean. Performing a number of such ex- 
periments with triplets of nonnegative numbers, we constantly observe 
the same result. We begin to suspect that we have found another 
theorem. Can it be true that there is an extension of Theorem 4.1, a 
result asserting that the arithmetic mean of three nonnegative quan- 
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tities is at least as great as their geometric mean? 
We wish to prove 


THEOREM 4.2. For any three nonnegative numbers a, b, ¢, 
(4.11) atb4¢> Yabe. 


The sign of equality holds if and only ifa=b=c. 
To remove the cube roots, let us set 


(4.12) a= x3, b=y3, c= z3. 
Substituting these values for a, 6, and c in (4.11), we obtain 


(4.13) Stee > xyz, 


which is equivalent to 
(4.14) x3 + y3 + 23 — 3xyz > 0. 


We shall prove Theorem 4.2 by proving the validity of (4.14) for 
arbitrary nonnegative x, y, z. 

Once again we have an expression that can be factored. Its factori- 
zation is not as common as the one used before, but is still a quite 
useful one. We assert that 


(4.15) x3 + y3 + 23 — 3xyz 
; = (x + y + zx? + y? + 2? — xy — xz — yz), 


a result that can be verified by multiplication. 

Since x + y + z is nonnegative, the first factor on the right in 
(4.15) is positive unless x = y = z =Q. In order to demonstrate 
(4.14), it is sufficient to show that the second factor also is nonnega- 
tive, i.e., that 


(4.16) x2 + yy? + 22@— xy — xz — yz > 0. 


Referring back to the inequality (x ~— y)? = x? + y? — 2xy >0 
already used in the algebraic proof of the arithmetic-mean-geometric- 
mean inequality for two numbers [see subsection (b)}, we see that the 
inequality (4.16) can be derived from this in the following fashion. 
Write 


(4.17) x? + yy? >2xy, x2 +22 >2xz, yp? +22 > 2yz, 
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and add the three inequalities. The result is 


(4.18) 2(x? + y? + 2?) > 2(xv + xz + yz), 


which is equivalent to the desired inequality (4.16). The sign of 
equality holds if and only if x = y = z. 

Since (4.16) is a valid inequality, and since x + y + z > 0, it fol- 
lows that the left-hand side of (4.15) is also >0; that is, the inequality 
(4.14) is valid. Since (4.14) is equivalent to (4.11), we have now proved 
that the arithmetic-mean-geometric-mean inequality is valid for 
means of three numbers; the condition x = y = z under which the 
sign of equality holds in (4.14), and therefore in (4.11), is equivalent 
to the conditiona=b=c. 


(f) The Arithmetic-mean-Geometric-mean Inequality for n Numbers. 
Emboldened by this success, let us conjecture that the results we 
have obtained for means of two and three numbers are merely spe- 
cial cases of a general theorem valid for any number of positive 
quantities. If this conjecture is true, then we have the following result: 


THEOREM 4.3. For any n nonnegative numbers ay, @z,....An, 
a a eee a. 
(4.19) Be Os aaake ae. 
n 


The sign of equality holds if and only if a, = ag = +++ =a. 


This is the famous inequality connecting the arithmetic mean of n 
quantities with the geometric mean of the » quantities, and is, indeed, 
true. We have concentrated on this inequality for several reasons. In 
the first place, it is a fascinating one, and one that can be established 
in a large number of interesting ways; there are literally dozens of 
different proofs based on ideas from a great variety of sources, In 
the second place, it can be used as the fundamental theorem of the 
theory of inequalities, the keystone on which many other very impor- 
tant results rest. In the third place, as you will see in Chapter 5, we 
can use some of its consequences to solve a number of maximization 
and minimization problems. 

In attempting to prove the general inequality, a first thought may 
be to continue along the preceding lines, using another algebraic 
factorization for n = 4, still another one for n= 5, and so on. 
But this approach is neither attractive nor feasible. As a matter of 
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fact, no simple proof along these lines exists. 

Instead, we shall present a simple proof based on two applications 
of mathematical induction; one, a “forward” induction, will lead to 
the desired result for all the integers n that are powers of two, i.e., 
for n = 2*; the other, a “backward” induction (from any positive 
integer to the preceding one), together with the forward induction 
will enable us to establish the result for all positive integers n. 


(i) Forward Induction. The method we shall now employ will 
illustrate an amusing variant of the fundamental technique of proof 
by mathematical induction, which we have previously discussed in 
Sec. 2.6 (Chapter 2). 

Let us start with the result for n = 2, namely 


(4.20) aah > Vab, 


which, by Theorem 4.1, is valid for all nonnegative a and b, and use 
some mathematical ingenuity. Although there are very many simple 
proofs of Theorem 4.3, they all possess the common ingredient of 
ingenuity. 
Set 
pas BAS b = Bt 
2 

where @), dz, 43, a4 are nonnegative numbers. Substituting these 
values for a and b in (4.20), we obtain the inequality 


Gi 02, Gas 
2 2 > a, + Q2\ /a3 + a4 
2 fe 2 2 i 
or 


(4.21) Qa; + a2 + a3 + a4 > (# + “\(* + *). 
a 2 


Since the left-hand side of the inequality (4.21) has the desired form 
(see Theorem 4.3), let us concentrate on the right-hand side. Using 
the valid inequalities 


(4.22) ose > Vaiae2, ase > Va304, 
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and the transitivity rule (Theorem 2.1), we obtain from (4.21) the 
further inequality 


(4.23) a Fe Tare V Vaia2 \laza, > (a,a2a304)"4. 


But this is precisely the desired result for four nonnegative quanti- 
ties! The arithmetic mean is greater than or equal to the geometric 
mean for n= 4. 
The sign of equality holds in (4.21) if and only if 
a, + a2 _ % + a4 

D) > 
and in (4.22) if and only if a; = a2, a3 = a4; consequently, it holds 
in (4.23) if and only if a; = a2 = a3 = a4. 
Nothing stops us from repeating the foregoing trick. Set 
— dit be an = Oat bs a, = U5 + 46 

5) ’ = 5} ’ 3 5) 

where the 5; (i = 1, 2,...,8) are nonnegative quantities. Substitut- 
ing in (4.23), we have 


bi + bot +++ bay Gi" b3 + bg *2 Ph) bz + bg\|1/4 
8 7 2 2 2 2 


Using the inequalities 


bb +b br +b 
aS Vik. ..., “E> Vos, 


— 51 + bs 


ay ’ a4 5 ’ 


and the transitivity rule, we obtain 


Pit Ppt ts > (VPibe VBaba VBsbe VBrbe)!* 
> (bib2 +++ bg)'8, 


the desired result for eight quantities. The sign of equality holds if 
and only if all the b; are equal. 

Continuing in this way, we clearly can establish the inequality for 
all values of 1 that are powers of two, i.e., for = 2, 4, 8, 16,.... 
To tie the result down rigorously, we use mathematical induction. The 
main step consists of proving the following result: 


The arithmetic-mean-geometric-mean inequality is valid for alln of 
the form 2*, k = 1,2,.... 
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Proor. We already know that the result is true for n = 2 = 21,i.¢., 
for k =1, and indeed for nm = 2? and 23, Let us assume that the 
result is true for an integer n of the form 2*, and then establish it for 
2k+1. Since 2*+1 = 2-2, this means that we shall prove that the 
result is true for 2”. 
Thus we have assumed that 


(4.24) ee So > (aya +++a_)m 
for any set of nonnegative quantities a1, @2,...,@,, where n= 2‘, 
Choose a; (/ = 1,2,...,”) to have the following values: 
b bg + b bon-1 + Bb 
a, = at BS - pee af a ae a zat 4 an 


where the 27 numbers 5; (j = 1, 2,...,2”) are given nonnegative 
numbers, and substitute in (4.24). Proceeding as before, we finally 
obtain 


by + be + +++ + bon 


Fe > @ibo+++ ban)”. 


As before, the sign of equality holds if and only if all the 6; are 
equal. Hence we have established the desired result for 2m, or 2*+1, 

Thus, since the inequality is valid for k = 1, the principle of 
(forward) mathematical induction asserts that the result is true for 
all positive integers k, and hence that the inequality (4.24) holds for 
all n that are powers of two. 


Gi) Backward Induction. Now that we have established the 
result for those integers that are powers of two, how do we establish 
it for the full set of positive integers? 

Another procedure is required. Consider the case n=3, for 
which, of course, we have already established the result by a differ- 
ent method. Using the relationship for n = 22 = 4, 


(4.25) a > (ayaeaga4)"4, 


which has already been established by forward induction, let us see 
if we can derive the corresponding result for n = 3. 
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We accomplish this by means of the important technique of spe- 
cialization. Starting with (4.25), we choose the quantities a1, a2, a3, 
and a, in a special way. Set 


(4.26) a;=b;, ag=b2, a3= bs, 
and ask for the value a, that yields the equality 


a+ a2+ a3 +44 _ b, + bo + bg 
4 = 3 : 


By the values given in (4.26), this requires that 


by +bo+b3 +44 b+ 62 + b3 
4 ~ 3 . 


whence 


by + bp + b3 


ay = S(b1 + be + bs) ~ (br + be + bs) = : 


Substituting these particular values for the a; in (4.25), we derive the 
relationship 


b, + bo + bg > Vb xbabs ( + bo + 2) 
3 oe . 3 : 


Raising both sides to the fourth power, we obtain 
(bet bet dV! > bypyps (2th + bs), 


or finally, dividing by (6; + bz + 53)/3, 
(AAP tO) > bybabs, 


which is equivalent to the desired result 


(4.27) Prt Pat Os > Wb Babs. 


Since equality holds in (4.25) if and only if a, = a2 = a3 = az, it 
follows that equality holds in (4.27) if and only if b; = bz = 63. 

In order to extend this method to the general case, we shall employ 
an inductive technique, but an inductive technique of nonstandard 
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type. Instead of proving that if the result holds for n then it holds for 
n+ 1, we shall prove that if it holds for n then it holds for n— 1. 
Since we already know that it holds for n = 2* (k = 1,2,...), this 
method will yield the theorem in all generality. 

Let us now show that if the result holds for n, then it holds for 
n — 1. To do this, we repeat the trick of specialization that we used 
above. Let 


(4.28) ay = b,, a2 z= be, ee aAn-1 = b = 
and determine a, by the requirement that 


A+ Goteret+ a, bi + bo teret+ baa, 


= : 


n n—1 
using the values (4.28) and solving for a,, we get 


_ 6, + be +o0+ + On 
(4.29) ag = tbe tt Oot, 


We have assumed that the inequality 


ai + atest any Yayag- +n 


n 


for the n nonnegative quantities a;,@2,...,@, is valid; substituting 
the values (4.28) and (4.29) for the a;, we have 


by + be + set bn y 
n—-1 aa 


Raising both sides to the nth power and simplifying, we obtain the 
inequality 


bib2 +++ bps 


b b ooo ba n-1 
bat be tor + bat) > bibe++*Bn_1, 


which is equivalent to the desired result 


bit ba tere Peet YER Bea. 
= 142 °°* Un-1- 


n—1 


As before, equality holds if and only if 6; = by = +++ = ba_1, and 
the proof of Theorem 4.3 is complete. 
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4.3 Generalizations of the Arithmetic-mean- 
Geometric-mean Inequality 


We shall now show that a number of results that appear to be 
generalizations of the fundamental arithmetic-mean-geometric-mean 
theorem, derived above, are actually special cases. 

First. let us take the arithmetic-mean-geometric-mean inequality 


X Xx eee xX 

Mee > (x1XQ0+ + X_)n 
and set the first m of the numbers x; equal to the same nonnegative 
value x, and the remaining n — m equal to a common nonnegative 
value y, that is, 


Xp SH XQS ces SH] Xm KX, Xm+1 = Xm42 =e HS Xn =. 
The arithmetic-mean-geometric-mean inequality for x1, x2,...,Xn 
becomes 


mx +(n — m)y renee 
ey ) 
or 


mx +(I _ m y> xm /nyl—min | 
n nf 


Here n is any positive integer, and m is any integer in the range 
l1<m<n-—1. It follows that m/n can represent any rational frac- 
tion r occurring in the interval O<r< 1. Let us then write the 
foregoing inequality in the form 
(4.30) rmx + (l— ny > xy, 
a most important result for our subsequent purposes. 

This inequality (4.30) is valid for any two nonnegative quantities 
x and y, and for any fraction r between 0 and 1. Equality occurs if 
and only if x =y. 

Let r be denoted by I/p. Since O< r<l, wesee that p>1. 
Then 

eg eee andl 
P P 

Let g denote the quantity p/(p — 1), so that 1/g=1—-—r and 
1 =I. 
p 


+ 
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The inequality (4.30) then has the form 


x,y 
4.31 X 42> xrvoyia, 
(4.31) +4 44 


To eliminate the fractional powers, set 
(4.32) x=a, y=bd9, 
Then (4.31) assumes the form 


(4.33) 22> ab, 

PB 4 
where a and b are nonnegative numbers, and p and q are rational num- 
bers satisfying 1/p + 1/q¢ = 1. There is equality if and only if 


(4.34) aP = bf, 


Once we have defined what we mean by irrational numbers and 
by functions of the form x’, where r is irrational, we can show, 
either directly or by means of a limiting procedure starting from the 
inequality (4.30), that the inequality (4.30) is actually valid for all r 
between 0 and 1, ‘and therefore that (4.33) is valid for all p> 1, 
q> 1 satisfying 1/p + 1/qg = 1. If you wish to pursue this refine- 
ment further, you should first read Numbers: Rational and Irrational 
by Ivan Niven, to which we have already referred in Chapter 1. 


Exercises 


vee 


. Show that if the values yi, yo,...,y, are all nonnegative, and if mm, mo, 
-.+,Mm, are positive integers, then 
myy1 + Mayo + +++ + MY 
m, + mz + ++ +m 
Show consequently that if ri,r2,...,r% are proper fractions satisfying 
ntretetm=l, 


> Ori™ye2™ oe oye) +mateoe+ my) 


then 
yr + raya tee + riya D> yrliyor2 .. Yee. 


2. An important average in statistics is the root-mean-square. For two non- 
negative numbers a and b, the root-mean-square is the value 


- {®#+B 
s= 5} . 


For the pairs (5, 12), (0, 1), and (p, p), compute the arithmetic mean and 
the root-mean-square. 
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Show that the arithmetic mean of two positive numbers is less than or equal 
to their root-mean-square: 


a+b eR 
y <V 7 


Under what circumstance does the sign of equality hold? How does the 
root-mean-square compare with the geometric mean and with the har- 
monic mean? 


. Let ABDC be a trapezoid with AB =a, CD = b (see Fig. 4.4). Let O 


be the point of intersection of its diagonals. Show that 


Figure 4.4. G tric illustration of —242_ a+b a® + b? 
ig eometric illustration o} p= Ves 5 < 5 


(a) The arithmetic mean (a + 6)/2 of a and b is represented by the line 
segment GH parallel to the bases and halfway between them. 

(b) The geometric mean \/ab is represented by the line segment KL 
parallel to the bases and situated so that trapezoids ABLK and KLDC are 
similar. 

(c) The harmonic mean is represented by the line segment EF parallel 1o 
the bases and passing through O. 


(d) The root-mean-square is represented by the line segment MN parallel 
to the bases and dividing the trapezoid A BDC into two trapezoids of equal 
area. 


4.4 The Cauchy Inequality 
(a) The Two-dimensional Version: (a2 + b?)(c2 + d?) > (ac + bd)?. 


Let us now introduce a new theme. As in a musical composition, this 
theme will intertwine with the original theme to produce further and 


more beautiful results. 


We begin with the observation that the inequality 
a? + b? > 2ab, 


on which all the proofs in the preceding sections of this chapter were 
based [see Sec. 4.2(b)], is a simple consequence of the identity 


a? — 2ab + b? = (a — b)?, 
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which is valid for all real a, b, not merely for nonnegative a, b. 
Consider now the product 


(4.35) (a? + b?)\(c? + d?). 
We see, upon multiplying out, that this product yields 
a2c2 + b2d2 + a?d? + brc?, 
which is identically what we obtain from expanding the expression 
(4.36) (ac + bd)? + (bc — ad). 
Hence we have 
(4.37) (a2 + Bb?) (c2 + d?) = (ac + bd)? + (bc — ad). 
Since the square (bc — ad)? is nonnegative, from (4.37) we obtain 
(4.38) (a? + b?)(c2 + d?) > (ac + bd)?, for all real a, b, c, d, 


a very pretty inequality of great importance throughout much of 
analysis and mathematical physics. It is called the Cauchy inequality, 
or, more precisely, the two-dimensional version of the Cauchy 
inequality. 

Furthermore, we see from (4.37) that the sign of equality holds in 
(4.38) if and only if 


(4.39) be — ad = 0. 


In this case, we say that the two pairs (a, b) and (c, d) are proportional 


to each other; if c“0 and d=0, the condition (4.39) can be 
written as 


(b) Geometric Interpretation. On first seeing the identity of the 
expressions (4.35) and (4,36), the reader should quite naturally and 
legitimately wonder how in the world anyone would ever stumble 
upon this result. It strikes him as having been “pulled out of a hat,” 
a piece of mathematical sleight of hand. 

It is a tenet of a mathematician’s faith that there are no accidents 
in mathematics. Every result of any significance has an explanation 


+A generalization of this inequality to expressions occurring in integral calculus 
was discovered independently by the mathematicians Buniakowski and Schwarz. The 
name “Cauchy-Schwarz inequality” is sometimes applied to the inequality in the text, 
but more particularly to its generalization. 
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which, once grasped, makes the result self-evident. The explanation 
may not immediately be obvious, and it may not be found for some 
time. Often the significance of a mathematical theorem becomes clear 
only when looked at from above—that is to say, from the standpoint 
of a more advanced theory. But the meaning is always there. This is 
a vitally important point. Were it not for this, mathematics would 
degenerate into a collection of unrelated formalisms and parlor tricks. 
Frequently, the simplest interpretation of an algebraic result is in 
terms of a geometric setting. Formulas that seem quite strange and 
complex become obvious when their geometric origin is laid bare. 


fo) 
Figure 4.5. Geometric interpretation of the Cauchy inequality 


Consider the triangle shown in Fig. 4.5. The lengths of the seg- 
mentst OP, OQ, and PQ are given by 
OP = (a2 + b%)1/2, 
OO = (c2 + d2)12, 
and 
PQ =[(a—¢? + (b~ dy]?, 


respectively. = 
Denote the angle between OP and O@ by @. By the law of cosines, 
we have 
(PQ)? = (OP)? + (OQ)? — 2(OP(OQ) cos 8. 
Substituting the values of OP, OQ, and PQ, and simplifying, we 
obtain 


ac + bd 


(4.40) cos 9 = (a2 + b%)12 (c2 + d2)12" 


+The length of a segment XY, often written XY, will be written XY in this book for 
reasons of typography. 
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Since the cosine of an angle must always lie between —1 and +1, 
we must have 


(4.41) 0 < cos?@< 1. 
Squaring both sides of (4.40) and (4.41), we obtain 
eee (ac + bd)? 4 


(a? + b?\(c? + d2) — . 
and finally 
(a2 + b?)(c? + d?) > (ac + bd). 


This is again the two-dimensional version of Cauchy’s inequality 
(4.38), which seemed so magical in its algebraic setting. 
Furthermore, we see that there is equality if and only if cos?@ = 1, 
ie., if and only if @ is a zero or straight angle—that is to say, if and 
only if the points O, P, and Q lie on the same line. In that case, we 
must have an equality of slopes; in other words, unless c=d=0, 


(c) Three-Dimensional Version of the Cauchy Inequality. An advan- 
tage of an interpretation of the foregoing type lies in the fact that we 
can use geometric intuition to obtain similar results in any number 
of dimensions. 

Turning to three dimensions, let P:(@, a@2,a3) and @Q:(b,, be, b3) 
be points distinct from the origin O:(0,0, 0). Then the cosine of the 
angle @ between OP and OQ is given} by 


4;b; + aeb2 + azb3 


6 = = SS 
ke (a2 + a2? + 32)? (by? + by? + 32)? 


which, together with the fact that cos?@< 1, implies the three- 
dimensional version of the famous inequality of Cauchy: 

(4.42) (Gi? + ao? + a3?)(by? + bo? + 53?) > (Gib1 + a2b2 + agb3)?. 
The sign of equality holds if and only if the three points O, P, and 


¢ For a derivation see W. F. Osgood and W. C. Graustein, Plane and Solid Analytic 
Geometry, Macmillan and Co., New York, 1930. 
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Q are collinear; this is expressed by 


a, _ a2 _ a3 


bets Ba! 


provided the b’s are all different from zero. 


(d) The Cauchy-Lagrange Identity and the n-Dimensional Version of 
the Cauchy Inequality. A strictly algebraic proof of the three-dimen- 
sional Cauchy inequality (4.42) may be obtained by noting that 
(a1? + ag? + a3?)(by? + be? + 53?) — (ayby + debe + a3b3)? 

= (a y7bg? + d27by?) + (a;2b3? + a37by?) + (a22b3? + a3?b2?) 
(4.43) — 2ayb,a2b2 — 2ayb,a3b3 — 2agbeasbs 
= (@1bz — azb)? + (ayb3 — ash)? + (a2b3 — azb2)?. 

The last expression in (4.43) clearly must be nonnegative, since it 
is a sum of three nonnegative terms. Hence 

(a? + a2? + a3?)(by? + be? + bs?) — (arbi + a2b2 + agb3)? > 0, 


and thus Cauchy’s inequality for three dimensions is proved again. 
The identity (4.43) yields both the inequality and the case of equality; 
the last expression in (4.43) is zero if each term vanishes, i.e., if the 
a’s and b’s are proportional. 

When you study the analytic geometry of three dimensions, you 
will see that the identity (4.43) is simply the result 


cos?@ + sin2@ = | 
written in another way. 
The identity (4.43) can be generalized as follows: For any set of 
real quantities a; and b; (= 1.2,...,”), we have 
(a1? + ao? + +++ + Gn? (bi? + bo? + +++ + by?) 
(4.44) — (ayby + debe + +++ + Anbn)? 
= (a,b2 — agb,)? + (a1b3 — d3by)? + +++ + (@n—18n — Andn_1)? 5 


this is the famous identity of Cauchy and Lagrange. From this identity 
we obtain the n-dimensional version of the Cauchy inequality 


(ay? + ay? + +++ + ay2)(bi? + be? + +++ + by?) 


(4.45) > (a,b, + dobo + +224 Anbn)? , 


valid for all real values of the a; and 4;j. 
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(e) An Alternative Proof of the Three-Dimensional Version. The 

proofs that we have given above were perfectly satisfactory as far as 

the results at hand were concerned. However, they do not generalize 

in such a way as to yield some results that we wish to derive later on. 
Consequently, we start all over again and introduce a new device. 

Let us begin with our basic inequality (x — y)? > 0 in the form 


x? oy 
(4.46) > + 5) > xy, 


which is valid for all real x and y, and substitute, in turn, 


ge ay pened | eee 
~ (ay? + ae? + a32)1/2’ is (by2 + be? + b32)1/2’ 
then 
oe ao eae Raat: Serra 
~ (a;? + a2? + a32))/2’ Ve (by2 + bo? + b3?)1/2 . 
then 
(ay2 + G22 + a32)'/2’ y (bi? + bo? + b32)22’ 
where the a; and 5, are all real quantities. Adding the three inequali- 
ties obtained in this way, we find 


ee + a2? + a) 1 (bi? + bo? + 72) 
2\a1? + ao? + as? 2\b1? + be? + bs? 
> ab, + aob2 + a3b3 
= (ay? + ao? + a32)1/2(b,2 + bo? + 532)?” 
which, very obligingly, yields the inequality 
I> ayb, + aob2 + azb3 
— (ay? + ao? + a32)1/2(5,2 + bez + b32)1/2 , 


equivalent to Cauchy’s inequality (4.42) [subsection (c)], as desired. 
The n-dimensional version (4.45) of Cauchy’s inequality may be 
proved in an analogous manner. 


4.5 The Hélder Inequality 


We now possess all the tools necessary to fashion one of the most 
useful inequalities of analysis, the Hélder inequality. This states that 
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for any set of nonnegative quantities a; and 6; (i = 1,2,...,n), we 
have 


(44? + Ag? + 00+ + ay?) /P(Dyo 4 bol Howe f b,4)1/4 


4.47) 
¢ > aby + aobz + +++ + andr, 


where p and q are related by the equation 


(4.48) 


and p> 1. 

The case p = q = 2 is the Cauchy inequality established in the 
preceding sections. In the general case, however, we must restrict our 
attention to nonnegative a; and bj, since fractional powers p and q 
might be involved. 

Actually we shall prove (4.47) only for rational numbers p, g; how- 
ever, the result is valid also for irrational p and q. 


We begin with the eee! 


Pal > ab, 
Pr +4 
which we established in Sec. 4.3; see (4.33) for rational p, g and non- 
negative a, b. Then we use the trick employed in Sec. 4.4. We set 


ay a by 


net Saeed be ee. 
(QyP + oP + +++ + ApyP)l/P (by9 + bot 4 +++ + bya)1/4 
then 
ee. ee ped oa es 
(QyP + AgP + ++ + dpP)/P’ (by4 + bot + +6 + b,1/9” 


and so on, and add the resulting inequalities. We thus obtain 


lL fay? + ap? + +++ + Gn? pet Pal tt bal 
P\GP + GoP + 02+ + a,? qQ\b19 + bo + +++ + by 


(4.49) 
ayb, + Agbg + +++ + Anda 


2 Gp + dae toee te a, ob, + bot + +06 + byM1/0’ 


or. finally, using (4.48), the result stated in the inequality (4.47). 
Equality can occur in (4.49) if and only if 5;?/a;? has a common 
value for all i. 
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4.6 The Triangle Inequality 


We are all familiar with the geometric theorem that the sum of the 
lengths of two sides of a triangle is greater than or equal to the length 
of the third side. Let us see what this implies in algebraic terms. 

Take the triangle to be situated as shown in Fig. 4.6. Then the 
geometric inequality 

OP + PR>OR 


is equivalent to the algebraic triangle inequality 


(4.50) Vx Fy + Vx22 + yo? > VOa + x2)? + Ci + y2)- 


R: (x)#X5, y,+ Yo) 


Figure 4.6. The triangle inequality 


Can we establish the triangle inequality (4.50) without resorting to 
geometry? In the one-dimensional case this was done in Sec. 3.8 (see 
Theorem 3.2), where the inequality was written in the form 

[xa] + [x2] 2 [x1 + x2! 
more often than in the equivalent form 
Vx? + VX2? > V(X + X2)*. 


Now, the simplest way of proving the two-dimensional triangle 
inequality (4.50) is to establish the validity of an equivalent inequality. 
To this end, we square both members of (4.50), obtaining the equiva- 
lent inequality 


ma + yr? + aa? tye? + VE EE VEE 
> (x1 + X2)? + O01 + y2)?, 


which is readily seen to be equivalent to 


(4.51) Vx1? + yi? \/x2? + yo? > xix2 + yiy23 
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but this is a simple consequence of the familiar Cauchy inequality 
{two-dimensional version, see (4.38)] 


(4.52) (x12 + y'1?)(x2? + yo?) > (x1x2 + yiy2)?, 


and thus the triangle inequality is established. 

As in the one-dimensional case (Theorem 3.2), the discussion of 
the conditions under which the sign of equality holds in the triangle 
inequality (4.50) is just a bit involved. You will recall that in the 
Cauchy inequality (4.52) the sign of equality holds if and only if 
(x1, 1) and (x2, y2) are proportional, i.e., x; = kx2, v1 = ky2. Now 
(4.51) is obtained from (4.52) by taking a square root of each mem- 
ber; this is a valid operation since it is the nonnegative square root of 
the left-hand member that was taken. But if x:x2 + yyye < 0, 
that is to say, if x:x2 + yyye is the negative square root of the 
right-hand member (x1x2 + yyy2)? of (4.52), then the strict inequality 
holds in (4.51) even though (x1, y1) and (x2, y2) are proportional. Thus 
the sign of equality holds in (4.51), and hence also in the triangle in- 
equality (4.50), if and only if x, = kx: and y; = ky with a non- 
negative constant k of proportionality.+ 

Geometrically, the foregoing necessary and sufficient condition for 
the sign of equality to hold in the triangle inequality (4.50) has the 
interpretation that the points O, P, and Q in Fig. 4.5 be collinear and 
that P and Q be on the same side of O. Then the triangle OPR col- 
lapses. In this case, we say not merely that P and Q are collinear with 
O, but that they are on the same ray issuing from O. 

You might check that the foregoing discussion is consistent with 
the discussion of the corresponding condition in the one-dimensional 
case |a|+|b|>|a+ bj, namely, that the equality sign holds if and 
only if a and b are of the same sign. 

Our proof of the triangle inequality can be generalized, following 
the pattern set in the proof of the Hélder inequality, to yield 


VE Fa Ed ae + VPP EE ee 
Pa (X1 +1 + (x2 + ya) Se Xn + Yn ’ 
valid for all real x;, y;, and with the sign of equality holding, as before, 


if and only if the x; and the y; are proportional with a positive factor 
of proportionality. We shall return to this inequality in Chapter 6, 


+ For example, (—3.4) and (6, —8) are proportional with a negative constant of 
proportionality, while (—3, 4) and (—6, 8) have a positive constant of proportionality. 
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when we consider its geometric significance. 

An alternative proof of the triangle inequality, which can be, 
extended to give a more general result, is the following. We write the 
identity 

(x1 + X2)? + (1 + y2)? 

= X4(X1 + X2) + yilyr + Ye) + X2(X1 + X2) + Yo(1 + ye) 


and apply the Cauchy inequality in square-root form [see eq. (4.51)] 
separately to the two expressions 


X1(x1 + X2) + Yili + ¥2) 
and 
X2(x1 + X2) + yo( yi + ye), 
obtaining 
(x12 + yr?)!/2f(e, + x2)? + C1 + ye)? 2? 
> X1(xX1 + X2) + i(y + y2), 
and 


(x2? + yo?)/2[(x1 + x2)? + (yr + y2)?}2 
> Xo(x1 + X2) + yo(yr + ye). 
Adding, we have 


[Cxa? + ys? + (xa? + yo)7I ler + 20)? + On + 2"? 
> (x1 + X2)? + Ci + ye)?. 
Dividing through by the common factor [(x1 + x2)? + (1 + y2)?}!?, 
we get 
(x1? + yr2)M/2 + (x2? + yo?)/2 D> [x + X02)? + (1 + yo)?]?. 


Thus we have again established the triangle inequality (4.50). 
Returning to the point where the Cauchy inequality was applied 
in square-root form, we see again that equality occurs if and only if 


X=kxe, yi=he 


for some nonnegative factor of proportionality k—that is to say, if 


and only if the three points O, P, and Q are collinear, with P and Q 
on the same side of O. | 
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4.7 The Minkowski Inequality 


We now possess all the tools and devices to establish another 
famous inequality, that due to Minkowski. It asserts that for any set 
of nonnegativef quantities x1, y1, X2, ye, and any p> 1, we have 
(xr + yiP)YP + (xa? + yap)? 

> [C41 + x2? + 1 + yor]? 
The triangle inequality is the special case p = 2. 

The proof of Minkowski’s inequality is similar to that just given 
for the triangle inequality, with the difference that the more general 
Hdlder inequality (Sec. 4.5) is used in place of Cauchy’s inequality. 
Write the identity 


(x1 + X2)? + (i + yoy? = [xalea + x2)!!! + yi( + y2)?73] 

+ [xoQe1 + xX2P7) + yo + yor 3] 
and apply Hdlder’s inequality to each of the terms separately. The 
results are 

(ea? + yaP)Al(xer + x2)O-O0 + (ya + ya)O“Va]Va 
> Xy(%1 + X2)P7} + yi(yi + y2)P™} 


(4.53) 


and 
(x2? + yor) P[(x1 + x2)O~Me + Cy + poje Daya 
> Xo(X1 + X2)P71 + yo(yr + yo)P?. 


Since I/p + 1/g = 1, wesee that (p — lq =p. Adding, we obtain 
[er + x2)” + Orr + ye Cer? + YP)? + (xe? + yo?) 
> (x1 + x2? + (i + yoy; 
and then dividing by [(x1 + x2)? + (1 + y2)P)]}/7, we get 
(Xa? + yiP)/P + (x2? + yor)? > [(x1 + x2)? + Cri + yopP]i4 


which, since 1 — 1/qg = 1/p, is the stated Minkowski inequality 
(4.53). 

The sign of equality holds in the Minkowski inequality if and only 
if it holds in the Hélder inequality [by means of which (4.53) was 
proved], that is, if and only if the points (x1, y1) and (x2, y2) [which 
are in the first quadrant} are collinear. 

+ The restriction to nonnegative quantities is necessary because, in general, fractional 
powers p and q are involved. 
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As you might surmise from the foregoing generalizations of the 
Cauchy, Hélder, and triangle inequalities, the general Minkowski 
inequality for sets of m nonnegative numbers 


X1,X2,-..,Xn and =-4, Y2,-- Yn 
is 
[x1? + Xo? + eee + XpP]MP + Lyi? + yor + ooe + yaP]i/P 
> [Oa + ya? + (2 + yoy? + o++ + On + Yn)P]? 
for p > 1. The inequality sign is reversed for p< 1. 


4.8 Absolute Values and the Classical Inequalities 


The arithmetic-mean-geometric-mean inequality, the Cauchy in- 
equality, the Hélder inequality, the triangle inequality, and the 
Minkowski inequality are the classical inequalities of mathematical 
analysis. For ready reference, they are collected in Table 3. 


TABLE 3. The Classical Inequalities for Nonnegative Values 


Name Inequality 
Arithmetic-mean- 
geometric-mean Qt ap F-** + Gn > (aya2-++a,)" 
inequality ” 
Cauchy (ay? + Gq? + +++ + Gq?)"/?(b1? + bp? + +++ + bg?)1? 
inequality 2 4yby + aabz + +++ + Anbn 
Hélder (41? + 2? + + + AqP)?/P(b17 + bot + +00 + 5qi)1/G 
inequality > ayby + Gabe + +++ + ands 
Triangle (G1? + G2? + +++ + Gq?)¥/? + (by? + ba? + 00+ + by?)? 
inequality > [far + 51)? + (G2 + bo)? + +0 + (Gn + by)*}2 
Minkowski (ay? + GaP + +++ + agP)YP + (by? + bo? + eee + bap)? 
inequality > [ai + by)? + (a2 + be)? + 08 + (Gn + ba)?]/? 


The inequalities are valid for any nonnegative values aj, dz, ..., Gy 
and 6, b2,...,5n; for arbitrary p>1; and for 1/p+ 1/q=1. 
The Cauchy inequality and the triangle inequality constitute the 
special case p= 2 of the Hélder inequality and the Minkowski 
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inequality, respectively. The sign of equality holds if and only if all 
the a; are equal, for the arithmetic-mean—geometric-mean inequality: 
the sets (a;), (6,9) are proportional, for the Hélder inequality; and the 
sets (a;), (bj) are proportional, for the other inequalities. 

The foregoing inequalities are primarily concerned with nonnega- 
tive values. However, the absolute value of any real number is non- 
negative, so that the inequalities apply, in particular, to the absolute 
values of arbitrary real numbers. This observation can be extended 
somewhat by means of the result in Theorem 3.2, which states that 
a sum of absolute values is greater than or equal to the absolute value 
of the corresponding sum; thus, relative to the Minkowski inequality, 
we have 


lai] + |i] > ai + dil, 


the sign of equality holding if and only if a; and 5, are of the same 
sign. The extended inequalities are exhibited in Table 4. 


TABLE 4. The Classical Inequalities for Arbitrary Values 


Name Inequality 


Arithmetic-mean- 


geometric-mean dan) + faa} e+ +140] 5 pgiag ees aq| tn 
inequality “ 

Cauchy (ay? + G2? + +++ + aq2)/2(b 12 + bo? +o 0+ + by?)Y? 
inequality > aby + aby + +++ + Anbn 


(lar|? + fae]? + +++ + Lan?) 


Holder o([Oi 9 + [bof tenet {bn | 94/4 
inequality 
> ayby + adgbz + +++ + anbn 
Triangle (412 + G2? + ++ + an2)1/2 + (by? + bo? 4 +++ + dy?l/? 
inequality > (Car + b1)? + (a2 + be)? + 00+ + (An + bn)?}? 
(lar |? + lae[? tee + fan [eye 
Minkowski + (bie + [bo]? tee + [bn [Py 
; li 
Peaedy ner > [lar + bi]? + Jaz + beh tee tld + Onl}? 
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For these inequalities involving arbitrary real values a1, a2,...,@n 
and by, be, ..., bn, the sign of equality holds if and only if all the |a;| 
are equal, for the arithmetic-mean-geometric-mean inequality; the 
sets (|a;|), (|5:|% are proportional and each a,b; nonnegative, for the 
Hdlder inequality; and the sets (a,), (b;) are nonnegatively propor- 
tional, for the other inequalities. 
As an application, for given sets of values (x1, X2,..., Xn); 
(Vi. ye; see Yn), and (21, 22,005 »2n), let 
mee ay = 2% — Yi and & =yj— %. 
Q{+th=4-y+V— MHA XM, 


and the extended Minkowski inequality yields 


(|Z1 — pil? +122 — yo]? + 2+ + [Zn — ynlP)¥/P 
+ (ys — Xa]? + lye — x2]? + ++ +1 yn — Xn)? 
> (|21 — Xa]? + [22 — x2]? +++ + [Zn — Xn[P)P; 


equality holds if and only if the sets (z1 — y1, Z2 — yo,..-,2n — Yn) 
and (yi — Xn ye — X2,.-+5Yn — Xn) are proportional with a non- 
negative constant of proportionality. 


4.9 Symmetric Means 


You might think, by this time, that the subject of inequalities has 
been pretty well exhausted. Actually, just the opposite is true. All 
that has gone before represents the barest scratching of the surface. 
The inequalities that we have established can be generalized in count- 
less ways, and then these generalizations in turn can be generalized. 
They can be combined in many ingenious ways to give many further 
results, as we shall indicate in this section and in Sec. 4.10. 

For example, one extension of the arithmetic-mean-geometric-mean 


inequality that is rather interesting is the following: Form the three 
means, 


Xy + X2 + X3 
mM =, 


— (X1%2 + X1X3 + X2x3\V/? 
ic Caen ay 


m= (x1X2x3 1/3, 
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Then the continued inequality 


m, > M2 > m3 


is valid for all nonnegative quantities x1, x2, and x3. We leave the 
proof of this for the reader. This, in turn, is a special case of the analo- 
gous result valid for any n nonnegative quantities, in which there are n 
mean values starting with the arithmetic mean and ending with the 
geometric. 


4.10 The Arithmetic-Geometric Mean of Gauss 


Let a and 5 be two nonnegative quantities, and consider the 
quantities a, and 5; defined in terms of a and b by 


-~4t+b b= Va. 


a= 2 ’ 


By our basic Axiom II, page 7, a, and b, must also be nonnegative. 
If we assume that @>b, we have 


a < fifa, b> VP =b. 


Furthermore, by the arithmetic-mean-geometric-mean inequality, — 
ay > by . 


Let us now repeat this process, introducing a2 and bz by means of 
the formulas 


a= mee, be = Vayb; 7 
Then, by the same reasoning as above, 
a>a> a, 
b < by < bp ’ 
and 
ae > bo ; 


Let us now continue this process, defining 


b 
a3 = ee, b3 = Vaeb2, 
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then a4, bg, and generally ap, b,, by the recurrence relationships 


(4.54) Pere es Gn. SE YP 


2 
After the kth step of our process, we have numbers aj, @2,..., a, and 
by, be, ..., by satisfying the inequalities 


a>a>ag>-s >a >be > bk > D>. 


For example, if a= 4 and b= 1, then the first few a’s and 6’s are 
represented on the line shown in Fig. 4.7. We observe that b is the 
smallest and a the largest of all these numbers, and that all the b’s 
are smaller than all the a’s. 


bz a2 
b b,\ J Q, a 
fc) ' 2 3 4 
Figure 4.7. The arithmetic-geometric mean of Gauss 


+ 


We need not stop after k steps. Suppose we continue defining more 
and more a’s and b’s by the relationships (4.54). Each pair aj, bj, so 
defined, is sandwiched between the previous pair aj_1, bj1. Thus it 
is plausible that the quantities a,, which decrease as n gets larger but 
which must always remain bigger than each };, approach some 
fixed value A; similarly, the b,, which increase as n gets larger but 
which remain smaller than each a;, approach a fixed value B. The 
reader familiar with the concept of limit will readily see that A is the 
limit of the infinite sequence of numbers {a,} and B is the limit of 
the infinite sequence {b,}. 

Moreover, the difference a, — b, becomes rapidly smaller as n 
increases. We can show that, at each step, this difference is less than 
half of what it was at the previous step. Just write, using (4.54), 


(4.55) nga ~ Dag = SPs SOY ree ae 2varbs t+ bn 


Since V/b,< Van, we have 
2b, <2 Vanbn , 
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adding a, — b, — 2\/anb, to both sides of this inequality, we obtain 


Ayn — 2Vanbn + bn <n — bn 
which, when applied to (4.55), yields 


Ans — Ong < + (an — bn) 


as we set out to show. 
Since the a’s and b’s approach each other in this manner, their 
limits A and B must coincide. So 


A=B, 


and this common limit depends only on the numbers a and b with 
which we started; in mathematical language, A is a function of a and 
b, It was shown by Gauss that this function is not just a curiosity but is 
rather basic in analysis; it may be used to found the branch of 
mathematics called the “theory of elliptic functions.” 


CHAPTER FIVE 


Maximization and Minimization 
Problems 


5.1 Introduction 


Now we shall demonstrate how the inequalities that were derived 
in the preceding chapter can be used to treat an important and fasci- 
nating set of problems. These are maximization and minimization 
problems. 

In the study of algebra and trigonometry, the problems you encoun- 
tered were all of the following general nature: Given certain initial 
data, and certain operations that were to be performed, you were to 
determine the outcome. Or, conversely, given the operations that were 
performed and the outcome, you were to determine what the initial 
data must have been. 

For example, you met three workmen, eager and industrious A, 
plodding and conscientious B, and downright lazy C. They were put 
to work digging ditches, constructing swimming pools, or building 
houses, and the problem was always that of determining how long it 
would take the three of them together to perform the job, given their 
individual efficiency ratings—or, given the time required for all three 
together, and the rates of effort of two of the three workers, it was 
required to find the rate of the third. 

Sometimes you were given triangles in which two sides and an 
included angle were known, and sometimes you were given three sides 
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of the triangle. In each case, the problem was that of determining the 
remaining sides and angles. 

The foregoing kinds of problems are simple versions of what may 
be called descriptive problems. 

Here, we wish to consider quite different types of problems. We 
shall consider situations in which there are many ways of proceeding, 
and for which the problem is that of determining optimal choices. 
Questions of this nature occur in all branches of science and consti- 
tute one of the most important applications of mathematical analysis. 
Furthermore, much of physics and engineering is dominated by prin- 
ciples asserting that physical processes occur in nature as if some 
quantity, such as time or energy, were being minimized or maximized. 

Many processes of this sort can be treated in routine fashion by 
that magic technique of Fermat, Leibniz, and Newton, the calculus. 
Supposedly, in the 17th century, men who knew calculus were pointed 
out in the street as possessing this extraordinary knowledge. Today 
it is a subject that can be taught in high school and college, and 
requires no exceptional aptitude. 

When you take a course in calculus, you will find that many of the 
problems treated here by algebraic processes can also be solved quite 
readily by means of calculus. It will then be amusing to solve them 
mentally by means of the techniques presented here, but to follow 
the formalism of calculus in order to check your answer. 

Each technique, calculus as well as the theory of inequalities, has 
its own advantages and disadvantages as far as applications to maxi- 
mization and minimization problems are concerned. It is rather 
characteristic of mathematics that there should be a great overlapping 
of techniques in the solution of any particular problem. Usually, a 
problem that can be solved in one way can also be solved in a num- 
ber of other ways. 


5.2 The Problem of Dido 


According to legend, the city of Carthage was founded by Dido, a 
princess from the land of Tyre. Seeking land for this new settlement, 
she obtained a grudging concession from the local natives to occupy 
as much land as could be encompassed by a cowhide. Realizing that, 
taken literally, this would result in a certain amount of overcrowding, 
she very ingeniously cut the skin into thin strips and then strung them 
out so as to surround a much larger territory than the one-cowhide 
area that the natives had intended. 


MAXIMIZATION AND MINIMIZATION 81 


The mathematical problem that she faced was that of determining 
the closed curve of fixed perimeter that would surround the greatest 
area; see Fig. 5.1. 

In its general form, the problem is much too complicated for us to 
handle. As a matter of fact, at our present level we don’t even know 
how to formulate it in precise terms, since we have no way of express- 
ing what we mean either by the perimeter of an arbitrary curve or 
by the area enclosed by this curve. Calculus suggests definitions and 
provides analytic expressions for these quantities, and a still more 
advanced part of mathematics called the calculus of variations pro- 
vides a solution to the problem. 

As one might guess, the optimal curve in Dido’s problem is a 
circle. Let us here, however, concentrate on certain simple versions 
of the problem that we can’‘easily handle by means of the fundamen- 
tal results we have already obtained in the theory of inequalities. If 
you would like to pursue the study still further, you should read the 
interesting tract Geometric Inequalities by N. D. Kazarinoff, also in 
this series. 


5.3 A Simplified Version of Dido’s Problem 


Let us suppose, for various practical considerations, that Dido was 
constrained to select a rectangular plot of land, as indicated in Fig. 5.2. 


x 


PERIMETER L 
AREA A y 
x 
Figure 5.1. Figure 5.2. 
The problem of Dido Simplified version of the problem of Dido 


With the lengths of the sides of the rectangle designated by x and 
y, respectively, we see that the length of its perimeter is given by the 
algebraic expression 


(5.1) L=2x + 2y, 
and its area is represented by the formula 
(5.2) A=xy. 
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Since x and y represent lengths, they necessarily are nonnegative 
quantities. The fact that the perimeter 2x + 2y is equal to L implies 
that x and y must satisfy the inequalities 


(5.3) £>x>0, S>y>d0. 


It is clear, then, that the area A = xy cannot be arbitrarily large. 
Indeed, from the inequalities (5.3) and the expression (5.2) for the 
area, we see by Theorem 2.5 of Chapter 2 that A cannot exceed the 
value L?/4; that is, we have 
(5.4) 4 > xy =A. 

How are we going to determine the dimensions that yield the 
maximum area? 

Referring to the inequality connecting the arithmetic mean and 
geometric mean of two quantities, we observe that 


(5.5) ~32> Vy 


for all nonnegative numbers x and y. Since, in this case, we have 
x+y=L/2, the inequality (5.5) yields the relationship 


(5.6) 4> V9 or Spy. 


Consequently, we see that our first rough bound for the area, L?/4, 
obtained in (5.4), can be considerably “tightened.” We can, however, 
go much further. Recall that we have established that the equality in 
(5.6) holds if and only if x = y. In our case, this means that the 
new upper bound, L2/16, is attained if and only if x = y. For all 
other nonnegative choices of x and y satisfying the relationship (5.1), 
the area is less than L?/16. 

What does the foregoing analysis tell us? It tells us that in no case 
can the area of a rectangle with perimeter L exceed the quantity 
L2/16, and that this maximum value is actually reached if and only 
if we choose the sides equal to each other, and therefore equal to the 
quantity L/4. 

Thus, by purely algebraic means we have achieved a proof of the 
well-known and rather intuitively obvious fact that the rectangle of 
greatest area for a given perimeter is a square. 
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5.4 The Reverse Problem 


Let us now consider the problem of determining the rectangle of 
minimum perimeter that encloses a fixed area. This is a dual or 
reverse problem to the original. 

Returning to the equations (5.1) and (5.2), we see that we now wish 
to determine the nonnegative values of x and y that make the expres- 
sion 2x + 2y a minimum, while preserving a fixed value A for the 
product xy. 

As might be expected, the same inequality between the arithmetic 
and geometric means yields the solution to this problem. From the 
relationship (5.5) above, we know that 


BAY sy aA. 


Hence we see that 


x+y>2VA, 
and that 
L=x+2>4VA. 


Consequently, we can assert that the perimeter must be at least as 
great as 4\/A and, furthermore, that this minimum value is actually 
attained if and only if x = y = \/A. Hence, once again, the optimal 
rectangular shape must be that of a square. 

This reciprocal relationship between the solutions of the two prob- 
lems is no accident. Usually, in the study of variational problems of 
this type, the solution of one problem automatically yields the solu- 
tion of the dual problem as well. For a proof of this duality principle, 
see, e.g., N. D. Kazarinoff’s Geometric Inequalities mentioned earlier. 


5.5 The Path of a Ray of Light 


Suppose we wish to determine the path ofa ray of light going from 
a point P to a point Q by way of reflection in a plane surface, as shown 
in Fig. 5.3. Actually, the problem as here stated is a three-dimensional 
one, but an extension of the following analysis shows that the ray 
must travel in the plane through P and Q and perpendicular to the 
reflecting plane. 

We assume that the medium is homogeneous, so that a ray of light 
travels at constant speed. How shall we determine the point R and 
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the paths PR and RQ? Let us invoke Fermat's principle, which states 
that the total length of time required must be a minimum for all pos- 
sible choices of the point R. Again, because the medium is homoge- 
neous, this means that the paths PR and RQ are straight lines and 
that R is so located that the length PR + RQ is a minimum. 

Let the coordinates of P, Q be (0, a), (q, b) respectively, and let r 
be the unknown distance OR. Then 


OP =a, TQ = b, OR=r, RT=q-74, 


PR= Va +2, RQ= VP +(q—-n?; 
see Fig. 5.4. Our problem is to determine r so that PR + RQ isa 
minimum, i.e., so that 


VEE + VEE G= TE 


and 


is a minimum. 


R 
Figure 5.3. Path of a reflected ray of light Figure 5.4. Determination of the point R 


We apply the triangle inequality as follows: 


Va +rF+ /P+q-r?> Vat b+ +9q—7) 
= (a + 2 + id 4 
Thus the distance traveled cannot be less than the fixed amount 


V(a + 6)? + g?; and this minimum value is achieved precisely if 
the sets (a, r) and (b, q — r) are proportional with a positive constant 
of proportionality, that is, if 


(5.7) a@__b xo 


rr” q-r 


Observe what the condition (5.7) means geometrically. It means that 
the right triangles ORP and TRQ are similar and that, since b is posi- 
tive, q —r is positive so that R falls between O and T. From the 
similarity of the right triangles, we conclude that the angles ORP and 
TRQ are equal. Therefore, their complements <— SRP and I SRQ 
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are also equal. Thus Fermat’s principle has enabled us to deduce the 
famous result that the angle of incidence is equal to the angle of 
reflection, 


and the more obvious fact that R lies between O and T. 

The above principle of reflection can also be proved by purely 
geometric considerations. If R’ in Fig. 5.5 denotes any point on the 
x-axis different from R, and if Q’ is the point (g, —5), then 


PR+RQ=PR+ RQ’ = PO’ < PR’ + RQ’ =PR + RQ, 
so that the point R is again seen to minimize the distance PR + RQ. 


° 


Figure 5.5. Geometric determination of the point R 


Suppose a plane divides two homogeneous media M, and M2 of 
different density so that in M, light rays travel with velocity vy 
along straight lines and in M2 they travel with velocity ve along 
straight lines. We now wish to determine the least time-consuming 
path from a point P in M, to a point Q in Mp; see Fig. 5.6. Again 
we have 


PR=V@4F, RQ=VPFG-7F 


Figure 5.6. A refracted ray of light 
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and wish to minimize the time (distance/velocity) 


_ Va +r 2 VF + G— ne 
vo 


| niet eee ool 
at 

While this is an easy problem in differential calculus, the minimum 
value does not seem to be obtainable directly from our elementary 
inequalities. The minimum value of ¢ is obtained if the path from P 
to Q is a broken straight line PRQ (see Fig. 5.6) located so that the 
angles 0, and 62 (between PR and the normal to the plane and RQ 

and the normal to the plane, respectively) satisfy the relation 


This relation is known as Snell’s law of refraction. 


5.6 Simplified Three-dimensional Version of Dido’s Problem 


Consider now the problem of determining the rectangular box that 
encloses the greatest volume for a fixed surface area; see Fig. 5.7. 
Designating the lengths of the sides by x, y, and z, we see that the 
volume is given by the expression 

V= xyz, 
while the surface area is represented by the formula 
A = 2xy + 2xz + dyz. 


x 


y 
Figure 5.7. Three-dimensional version of the simplified problem of Dido 


Given the value of A, we wish to choose the values of x, y, and z 
that make V as large as possible. Once again, an application of the 
arithmetic-mean-geometric-mean inequality will resolve the problem 


for us. 
Considering xy, xz, and yz as three nonnegative quantities, we 
obtain the inequality 


(5.8) TPE > feo erz p29 = (xyz. 
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We know that equality can occur if and only if 


xy = xz = yz, 


a relationship that holds if and only if 


x= =2Z. 
Since 4 
xtxz+yz= 4, 
the inequality (5.8) yields 
4 > dyzP3 = V2s8 or (4 >V. 


It follows that the volume of the rectangular box with surface area 
A is less than or equal to (4/6)?/2, and that this value is attained if 


and only if 
Be ee ek Vee 
s=ys2 ( A) é 


Hence, the rectangular box of maximum volume for a given surface 
area is a cube; and, dually, the rectangular box of minimum surface 
area that encloses a given volume is a cube. Once again, we observe 
a reciprocal relationship between two problems. 


Exercises 


I. Show that, if the sum of the lengths of the twelve edges of a rectangular 
box is a fixed value £, then the surface area A of the box is at most 
£2/24, and that the box is a cube if and only if £2/24= A. 

2. State and prove the reverse of the result expressed in Exercise 1. 

3. Of all rectangles having the same length of diagonal, determine which has 
the greatest perimeter and determine which has the greatest area, (Use 
the results in Exercise 3 on page 24 or Exercise 3 on page 62.) 


5.7 Triangles of Maximum Area for a Fixed Perimeter 


Let us now consider the problem of determining the triangle of 
maximum area for a given perimeter. In Fig. 5.8, let s denote half 
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Y z xX 
Figure 5.8. The problem of Dido for a triangle 


the length of the perimeter of the triangle shown; that is, let 


s=A tht? 


As is well known, the area A of the triangle can be expressed by the 
formula 


A =[s(s — xs — ys — 2)". 
We wish to find the maximum value of the area as x, y, and z vary 
over all positive values for which the relationship 
2s=x+y+z 


holds, where the value s is fixed. 

Once again, the arithmetic-mean-geometric-mean inequality yields 
the solution in a very simple way. For the three nonnegative values 
s—x, s—y, and s—z, we have 


[(s — xXs — ys — Z)'3 < (s— x) + Eo ) + (s — 2) 
<3s~+y +2) _35—25 _s 
7 3 catia: ee: 


Hence, 


(5.9) (s = x5 — Wis — 2) < (3) 


3 
From (5.9), by easy steps, we obtain the result 


A = [s(s — xs — ys — 2)}? < (3) | = eae = we 


There is equality if and only if 
S-X=S—y=5s-2, 


that is, if and only if x = y = z. Consequently, we can assert: 
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THEOREM 5.1. Of ail triangles with a fixed perimeter, the equilateral 


triangle maximizes the area. 


Observe that all the results we have so far obtained in this chapter 


seem to indicate that symmetry and optimal behavior are closely in- 
terrelated. Perhaps the profound and aesthetic observation of Keats, 
“Beauty is truth, truth beauty,” sums this up best. 


— 


N 


w 


A 


Exercises 


. Let s denote half the length of the perimeter and A the area of the triangle 


having sides of length x, y, and z, so that 
s= cokss 
and 
A2 = s(s — x\(s — ys — z). 
For the special case of an isosceles triangle, with y = z, let the area be 


denoted by J; and for the special case of an equilateral triangle, with 
x =y =z, let the area be denoted by E. Show that 


P= zx(s - x) 
and 
= st 
eae 
. With the notation of Exercise 1, show that, for triangles of equal perime- 
ter s, 
pes _ 3x\? 
B-P= a( + 31) ( ) 
and 


P — A? = i(s — xy — 2), 


where x is the length of the base of the isosceles triangle and also the length 


- of one side of the general triangle. 
. Using the formulas in Exercise 2, show that 


E2—12>0 
I2?—A?>0. 
Under what circumstances do the signs of equality hold? 


and 


. Using one of the inequalities in Exercise 3, show that, if the perimeter and 


one side of a triangle are given, then the area is maximized by making the 
triangle isosceles. 


. Using one of the inequalities in Exercise 3, show that of all isosceles tri- 


angles with a given perimeter, the equilateral triangle maximizes the area. 
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6. Using the inequalities in Exercise 3, explain how the formula 


A= VET (E?-T)— (A) 


exhibits A in terms of nonnegative expressions, and how the results stated 
in Exercise 4, Exercise 5, and Theorem 5.1 follow from this formula. 

7. Of all right triangles having the same length of hypotenuse, determine 
which has the greatest altitude to the hypotenuse. 


5.8 The Wealthy Football Player 


Consider now a problem of a similar but slightly more complex 
type. 

A football player, having parlayed his athletic prowess on the 
gridiron into a position in Wall Street and thence by natural stages 
into the possession of a variety of stocks and bonds of his own, be- 
came, in the course of time, quite wealthy. Being of a sentimental 
turn of mind, as retired football players are wont, he stipulated in 
his last will and testament that he be buried in a vault shaped like a 
giant football. His executors, honoring his last request, were faced 
with the problem of carrying out this assignment in the most eco- 
nomical fashion. 

After some thought, they decided that the mathematical problem 
most closely approximating the physical situation that they faced was 
that of enclosing a rectangular box of given dimensions within an ellips- 
oid of the least possible volume. 

Following the leads contained in the foregoing problems, they 
began with the reverse problem, i.e., that of inscribing a rectangular 
box of greatest volume in a given ellipsoid. 

According to solid analytic geometry, an ellipsoid with center at 
the origin and axes along the coordinate axes has an equation of the 
form 


(5.10) ate ae 


where 2a, 2b, 2c represent the lengths of the axes of the ellipsoid. Now, 
it is intuitively clear that the inscribed rectangular box will have its 
center at the origin and its sides parallel to the axes. Thus, if one ver- 
tex of the box is at the point (x, y, z) on the surface of the ellipsoid 
(see Fig. 5.9 which shows one eighth of the ellipsoid), then the other 
seven vertices must be located symmetrically. This means that the 
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Figure 5.9. One octant of a box in an ellipsoid 


coordinates of the other seven vertices of the box are given by 
(—x,), 2), @ —y); 2), (x), —z), (—-x, —y)y; 2); 
(% —-y,-2), (—%Y,-2), (—% —y, -2). 
Since the sides of the box have lengths 2x, 2y, 2z, it follows that 
the volume of the box is given by the expression 
(5.11) V = 8xyz. 


The problem that confronts us is that of maximizing the expression 
(5.11) subject to the condition that the quantities x, y, and z satisfy 
eq. (5.10). 

Once again, this problem is easily resolved by use of the arithmetic- 
mean-geometric-mean inequality. We have 


} x2 y? ze x2 Fis ae 1/3 
on) ie tert Ze (3 ro zy 


Using the fact that x, y, and z satisfy eq. (5.10), and using the formula 
(5.11) for V, we see that the inequality (5.12) yields 


1 V2/3 
3 = 4(a2b2c2)1/3 
or 


(abe) > v2, 


It follows that the volume of the box is at most 8abc/3,/3; further- 
more, by the condition for equality in (5.12), this value is attained if 
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and only if 
x? peat fi eae | 


a pb 3 


or 


a b c 
(5.13) ery i ay 3, ae 

These values (5.13) furnish the desired dimensions of the box, 
given an ellipsoid with semiaxes a, b, and c. 

Now let us return to the original problem. We have seen that for 
given values a, 5, c, the values (5.13) for x, y, z give the lengths of 
the half sides of a rectangular box of maximum volume. But in the 
other direction, is it true that for given values x, y, z, the values 


(5.14) a= V3x, b=V3y, c= V3z 


give the minimum volume for a containing ellipsoid? Whether the 
executors were perspicacious or just lucky, they were right in follow- 
ing their mathematical hunch. Here is a proof of that fact: 

The volume W of the ellipsoid (5.10) is given by the formula 


W= fn abe, 


and for given positive x, y, z we want to choose positive a, b, c satis- 
fying eq. (5.10) in such a way as to minimize W. 

In order to see the reciprocal relationship between the two prob- 
lems, it is convenient to consider reciprocals; and since it is. now 
x, y, Z that are given and a, b, c to be determined, we also reverse 
ends of the alphabet and set 


aly. pak oe oe 
(5.15) ¢=y, b= 5 eee Ge ce 


Eq. (5.10) then becomes 


(VAP. (BY, ACY 
(yxy * (iret (a7zp! 


or 
(5.10’) Stor taal: 


and, subject to this constraint, we wish to choose X, Y, Z so as to 
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minimize 


W =4nabe. 
3 
Since the coefficient 47/3 is a constant, minimizing W is equivalent 
to minimizing 


4] 
abe = Yyz 


which, in turn, is equivalent to maximizing 
(5.11’) V’ = 8XYZ. 


But this is just the problem we have already solved, namely, the 
problem of determining the box of maximum volume that can be in- 
scribed in a given ellipsoid. It does not matter that this ellipsoid 
(5.10’) and box of volume (5.11’) have no physical existence relative 
to our football player; indeed, this fact serves to emphasize and 
dramatize the importance of pure mathematical analysis. The solu- 
tion values {cf. eq. (5.13)] are 


(5.13") x=, y=y z= 

Substituting from (5.15) into (5.13’), we obtain (5.13) and (5.14). 
Considering, in a typical case, a length 2x of 6 feet, a width 2y of 

2 feet, and a height 22 of 1 foot, we obtain the values 


a=3¥V3, b= V3, c= 3/2, 


for an ellipsoid of minimum volume that would contain the box. 

Because of the curvature of the elliptical vault, there would be 
ample room for the inclusion of a few well-worn footballs to accom- 
pany our gridiron hero to his place of rest. 


5.9 Tangents 


Let us now apply the theory of inequalities to the problem of find- 
ing a tangent to a given curve. Of course, we can do this only in an 
intuitive fashion at this time, since any precise definition of what we 
mean by a tangent to a curve at a point lies outside our chosen 
domain of discourse. 
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Consider a curve determined by an equation of the form y = f(x), 
as shown in Fig. 5.10, and a line, with equation 


(5.16) mx+ny=k, 


that cuts the curve in the points P and Q. Let us move the line parallel 
to itself [this merely involves changing the value of k in the equation 
(5.16) of our line} until these two points become coincident at R, as 
shown in Fig. 5.11. The line mx + ny = k’ may be called the tan- 
gent line to the curve y = f(x) at the point 2. Let us again point 
out that we are proceeding in an intuitive fashion without attempt- 
ing to make the notion of a tangent precise. But you are familiar at 
least with tangents to circles and can see that the procedure does lead 
to the desired result in this special case. 


mx¢ny =k’ 


Q 


ie) 
Figure 5.10. A curve and a cutting line Figure 5.11. A curve and a tangent line 


We shall use the foregoing procedure and the theory of inequalities 
in order to determine the tangent lines in a given direction to an 
ellipse. Suppose the ellipse (see Fig. 5.12) is given by the equation 

x2 2 
grea 


and let (x;, yi) be one of the two points of tangency for lines of the 
form mx + ny =k, where mand mare fixed, m? + n?0, and k 
varies through all values. Observe that this point of tangency can be 
characterized in the following way: 

(a) The point (x;, y1) lies on the ellipse; ie., the values x), yy 
satisfy the equation 

x1? yr ea 

(5.17) ar + ee = L; 

(b) The point (x1, y1) lies on the line; i.e., the values x,, y; satisfy 
the equation 


mxytni=k. 
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R: {.y) 


Figure $.12. An ellipse and two parallel tangent lines 


(c) The distance from the origin to the line mx +ny=k isa 
maximum for all points (x1, y1) satisfying conditions (a) and (b), as 
K varies. 


The distance d from the origin to the line mx + ny =k is given 
by the formula 


= \K|_ | mx, + ny, | 
6.18) = eae 
To see this, observe that the line 
(5.19) mx+ny=k 


has slope —m/n (see Fig. 5.13) and that a perpendicular line OP 
through the origin must therefore have the equation 


(5.20) yx. 


Solving the system of linear equations (5.19) and (5.20), we find the 


y 


Figure 5.13. The distance formula 
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coordinates of P to be 


km kn 
x= = | 
m+n? ~~ mam 


The distance from the origin to P is therefore 


£3 M k2m2 k2n2 1/2 
as OF= ke: + n?2)2 " (m2 + od 


_{_ 1k] 


1°2 
(ata ma 3) 7 


and, since k = mx1 + ny; by condition (b), formula (5.18) is proved. 
It follows that the problem of determining the tangent line is that 
of maximizing the expression (5.18) over all x; and y1, subject to the 
constraint (5.17). 
An application of Cauchy’s inequality, i.e., the inequality (4.38) in 
Sec. 4.4(a) of Chapter 4, yields 


_ |mxy + nyil _ |(am)(x1/a) + (bn y1/b)| 
a a 


‘azm2 + ben2 1/2 x4? yy? 1:2 
oe < (Rt) (E+E 


_ farm? + b2n2\2 
= ( m2 + n2 ) : 
The points of tangency are determined by two conditions. Namely, 
they must lie on the ellipse, so that (x1, 1) satisfies (5.17); and the 


distance (5.18) must attain its maximum value, so that the equality 
sign holds in (5.21), which is the case if and only if 


X1/a 2 yi/b 


(622) am bn 


The solution of the system of linear equations (5.17) and (5.22) yields 
the values 


= atm -_ b2n 
(5.23) xy= = Gm? + Bnd’ yi= = G@mt + Prd?’ 


where either both plus signs or both minus signs are taken, and the 
desired values k’ and k” (see Fig. 5.12) of the constant & can then be 
found by substitution in eq. (5.19). 
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5.10 Tangents (Concluded) 


With a slight amount of additional effort, we can obtain a much 
more elegant result, resolving the problem of finding the tangent to 
an ellipse at a given point on the ellipse, rather than the tangent in a 
given direction. 

In place of solving for x; and y; in terms of m and n, we need only to 
solve for m and n in terms of x, and y;. It is that simple! From 
eq. (5.22), we see that we must have 


rx: 
= n= 21 
a b2 


where r is some as-yet-undetermined constant of proportionality. 
Using these values in the equation mx +ny=k, we obtain the 
equation of the tangent line in the form 


(3)-+(a)= 


e@ RB” r 


or 


Since, on the one hand, (x, 1) is a point on this tangent line, and 
since, on the other hand, it is also on the ellipse—that is, (x1, y1) sat- 
isfies equation (5.17)—we must have k/r = 1. Hence, we obtain the 
very simple and elegant result that the tangent to the ellipse 


ee Re 
mt Rp | 
at the point (x;, y1) on the ellipse has the form 


coma? ere A A Nee 
a tp ol. 


As you will see when you study calculus, this is the same result that 
one obtains through the use of derivatives. 


Exercises 
1. Determine which of the points (5, —3), (3, 5), and (7, 0) is on the ellipse 


2 = 
30 tig =!- 
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. For x = 2, determine the values y such that the point (x, y) lies on the 


ellipse in Exercise 1. Also for x = 2, determine the values of y such that 
the point (x, y) lies either inside or on the ellipse. 


. Determine the equation of the tangent to the ellipse in Exercise 1, at the 


point (—5, 3) on the ellipse. 


. Solve the system of equations 


x2 2 

ata) 

mx+ny=k, 
where a, b, m, and n are fixed, choosing k so that the resulting quadratic 
equation has a double root. (You will find that k? = a?m? + b?n?.) 
Compare your answer with the values of x, yi given at the end of Sec. 
5.9, page 96. 


CHAPTER SIX 


Properties of Distance 


6.1 Euclidean Distance 


The distance with which you are familiar, between two points 
P:(x;,y1) and Q:(x2,y2) in the (x, y)-plane, is called Euclidean dis- 
tance. We shall denote it by d2(PQ); it is given by the formula 


(6.1) d(PQ) = [(x2 — x1)? + (v2 — yn)?]?. 


We shall now enumerate some of the properties that characterize 
this distance function. 

1. The distance between two points depends only on the position 
of one relative to the other; that is, it depends only on the differences 
X2—X, and ye—y. of their coordinates. This property (that the 
distance between two points does not change when both points are 
shifted by an equal amount in the same direction) is called translation 
invariance. 

2. The distance from a point P to a point Q is equal to the distance 
from Q to P. This is seen by verifying, in (6.1), that 


a_(PQ) = d{QP). 
Property 2 is usually called the symmetry of the distance function. 


9 
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3. The triangle inequality 


d(PR) < do(PQ) + d{QR) 


is satisfied by the distance function (6.1); see Sec. 4.6. 
4. The distance d.(PQ) between any two points is nonnegative 
for all P and Q. That is, 


a(PQ) > 0; 


the sign of equality holds if and only if the points P and Q coincide. 
This is often called the positivity of the distance function; it follows 
immediately from the definition (6.1). 

5. If P has coordinates (x, y) and Q has coordinates (ax, ay), where 
a is a nonnegative constant, then 


4(OQ) = ad,OP); 


here O denotes the origin (0, 0). This property is sometimes called 
the homogeneity of the distance function, and it holds because 


do(OQ) = [(ax)? + (ay)?]/? = [a2(x? + y?)p/? 
= a(x? + y?)!/2 = ad(OP). 


The Euclidean distance has still another property: 

6. The Euclidean distance between two points remains unchanged 
if the (x, y)-plane is rotated about the origin through some angle. 
This property is sometimes called rotation invariance. 


6.2 City-Block Distance 


It turns out that many other useful and interesting “non-Euclidean” 
distances can be defined. In order to be called a “distance,” a func- 
tion of P and Q must have the properties 1 through 5 that we just 
verified for the familiar distance (6.1). The Euclidean distance d2 
alone has all six properties. 

As an example, let us invent a “‘city-block distance” by determining 
the actual length of a path between two addresses P:(x1,y1) and 
Q:(x2,y2) in your home town, assuming that all the streets are laid 
out strictly north-south and east-west, and that there are no empty 
lots to cut across; see Fig. 6.1. Any path from P to Q is made up ex- 
clusively of horizontal and vertical pieces, so that the distance d(PQ) 
we would have to traverse consists of the sum of all the horizontal 
distances and all the vertical distances making up the path from P to 
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STREET 


AVENUES 


Figure 6.1. City-block distance 


Q. We shall therefore define city-block distance d, by 
(6.2) d,(PQ) = |x1 — x2|+|y1 — yal, 


although, strictly speaking, this does not quite accurately describe 
the situation. It is not correct for the case shown in Fig. 6.2, where 
the addresses P and Q lie between the same two north-south (or 
east-west) streets; in this case, the traveler is forced to reverse direc- 
tions in the course of his walk. Nevertheless, let us take (6.2) as the 
definition of a new “non-Euclidean” distance. After all, our city-block 
example just served as a motivation. (If our blocks are very small, 
(6.2) is fairly accurate. More precisely, our new distance function is 
the minimum distance required in traveling from P to Q when con- 
strained to move only in the four principal compass directions.) 


Figure 6.2. City-block distance, exceptional case 


Next, let us see if d;, as defined by (6.2), has the five properties 
required of a distance. 

Since only the difference of the coordinates enters the expression 
(6.2), our new distance is certainly translation invariant, so that it has 
property 1. 

Since d;(PQ) = di(QP), d, is symmetric, and thus it has prop- 
erty 2. ; 
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To establish the triangle inequality 
d,(PR) < 4,(PQ) + d,(QR), 
let P, Q, R have coordinates (x}, v1), (x2, y2), and (x3, y3), and write 
dy(PR) = [x1 — x3| + [1 — ys| 
= |[X1 — X2 + X2 — x3| +1 y1 — ye + yo — ya; 
since, by Theorem 3.2, page 42, 
|X1 — Xg + X2 ~ X3| < [x1 — Xo] + [x2 — x3I, 
yi — v2 + 2 — ¥3| <|y1 — vel + bye — ysl, 
we have 
4,(PR) < | x1 — x2| +11 — yel + |x2 — x3| + lye — ys} 
= a,(PQ) + a,(QR). 


The city-block distance certainly satisfies our fourth condition 
since the absolute value of any real number is always nonnegative. 
It is positive unless P and Q coincide. 

Property 5 is easily verified since, for a > 0, 


|ax| + lay] = af|x|+lyiJ. 


Next, let us try to generalize the notion of a circle from Euclidean 
to city-block geometry. In Euclidean geometry, a circle is the locus 
of points equidistant from a fixed point. Let us carry this definition 
over to our new geometry. According to (6.2), the “unit circle” with 
center at the origin 0:(0,0) would be given by the equation 


a(OP) =|x|+|y|=1. 


Its graph in the ordinary Euclidean plane is shown in Fig. 6.3 (see 
also Fig. 3.14). 
y 


LN. 
NZ 


Figure 6.3. The unit circle in city-block geometry 
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6.3 Some Other Non-Euclidean Distances 


Suppose that we now define the “distance” between the origin O 
and an arbitrary point P to be given by the expression 


(6.3) d,(OP) = (|x|? + Ly |p)ve 


for some fixed value of p > 1, and generally, that we define the dis- 
tance between any two points P:(x1,y1) and Q:(xe2,y2) to be 
given by 


(6.4) 4(PQ) = {x1 — x2|? +|y1 — yel?]’?. 


Again, we check the five distance properties. The distance (6.4) 
certainly is translation invariant. Moreover, it is symmetric; that is, 
d,(PQ) = d,(QP). Thirdly, the triangle inequality follows from for- 
mula (4.53’), which was proved at the end of Sec. 4.8 from the Min- 
kowski inequality. Property 4 on positivity is also satisfied by (6.4); 
and finally, for P:(x,y) and Q:(ax, ay), we have 


4,(0Q) = [lax|? + |ay|?}*/? = (a )7[| x]? + |y |}? =ad(OP), 
so that our distance d, also possesses the fifth property. 
In this case the “unit circle,” i.e., the locus of points at a distance 
1 from the origin, is given by 
Ix}? +|yl? = 1. 


Just what such locus curves look like depends on the particular value 
of p. For example, when p = 1, we are back at the city-block unit 


Figure 6.4. Euclidean graphs of non-Euclidean “unit circles” 
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circle; for p= 2, we have the usual Euclidean unit circle. For 
p = 1,2,4, we have the inequalities 


Ix[+ [yl > (xl? + yl?? > (x14 + Lyles, 


which are easily verified by squaring. The Euclidean graphs of the 
corresponding “unit circles” are shown in Fig. 6.4; the curve for 
p = 1 is contained in that for p = 4. Is it true, in general, that the 
“unit circles” for distances defined by (6.3) are situated in such a way 
that the one for any fixed p contains those corresponding to smaller 
p? If so, do these curves become larger and larger as p increases? 

To answer the first question, we first formulate an equivalent 
question: Does the inequality 


Lx Lyi" > Lxle eye 


hold whenever m > n> 1? It does indeed. Let us give a proof for 
m=3, n=2, and leave the general case for the reader as an 
exercise. 

In order to avoid writing so many absolute-value signs, let us 
introduce 


a=|x|, b=jpI. 
We must then prove 
(a? + b2)1/2 > (a3 + 53/3 fora,b>0. 
We write 
a + b3 = aa® + bb? 


and apply Cauchy’s inequality (in square-root form), obtaining 


(6.5) (a2 + b2)1/2(a4 + b4)!/2 > a3 + BS. 
2 a? + b2 > (at + bA)12, 
(6.5) yields (a2 + B)/2(a2 + b2) > a8 +B 
“ (a2 + b?)3/2 > a3 + B3, 
whence 


(a2 + b2)/2 > (a3 + b3)1/3, 


The proof for arbitrary rational m > n> 1 can be obtained by a 
corresponding application of Hélder’s inequality. It is not difficult to 
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see from the above discussion that, as p approaches 1, the “unit 
circles” 

jxjp+iylP=1l, p>, 


approach the square |x|+|y|= 1 shown in Fig. 6.3. 
In order to see what happens as p gets larger and larger, we observe 
that 


(6.6) max {|x|*%|y|?} <|xl? + yl? < 2 max {|x]*,|y|?}. 


Since 
[max {|x|?,|y|?}}/? = max {|x|,|y1}, 
(6.6) yields 
(6.7) max {|x|,|y1} < [lx]? + ly|#}”? < 21 max {|x|,|y]}- 


Now consider what happens to the right-hand member of (6.7) as p 
becomes very large; p occurs only in the exponent 1/p of 2. As p be- 
comes very large, 1/p becomes very small and 21/? therefore ap- 
proaches 29= 1. In other words, (6.7) tells us that 


d,(OP) = [|x]? + |y|?]}*/? 
approaches the distance 
(68) d(OP) = max {|x|,1y]} 


as p “approaches infinity.” It can be shown that d,, has all five 
distance properties. 
Now, what does the “unit circle” 
max {|x],|y]} = 1 
look like? It is given by the square with sides 
Ixj=1, O<|yI<1 
vee i=l, OS[x<1. 


So we see that 


4,(PQ) = [|x1 — x2]? + lyr — yo] 9}? 


can be used as a distance function for all p > 1 and that, as p grows 
larger, the “unit circles” 


[xl? +|yl? = 1 
grow larger, approaching the square (6.9) as p approaches infinity; 
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in spite of the fact that these “unit circles” grow bigger and bigger, 
they never get out of the square (see Fig. 6.4). 

The “unit circle,” in all these cases, divides the (x, y)-plane into two 
regions: the interior of the “unit circle,” comprised of all points at dis- 
tance less than | from the origin, and the exterior, comprised of all 
points at distance greater than | from the origin. The set of points 
defined by the inequality 


aOP) <1 
is sometimes called the unit disc, and the “unit circle” 
aOP) = 1 


is called the boundary of the unit disc. 

Some general remarks are in order. The Euclidean distance, as 
pointed out earlier, is invariant under translations (property 1) and 
rotations (property 6), i.e., under the so-called displacements or 
rigid-body motions. The other distances treated above also do not 
change under translation, but they do change under rotation. In fact, 
we can see from Fig. 6.4 that the city-block distance d; goes into the 
distance d,, = max {|x2 — xi|,|y2 —y1|} (stretched by a factor 
2) when the (x, y)-plane is rotated through 45° about the point 
(x1, yi). It can be shown (but we shall not do so here) that the 
Euclidean distance is completely characterized by the six properties 
enumerated in Sec. 6.1; this means that the only distance that has, in 
addition to the five required properties, the property of rotation 
invariance is the Euclidean distance. 


6.4 Unit Discs 


There are many other functions having the five properties required 
of a distance; we have considered only a few. 

We might ask the following question: Given a set S of points 
(which we shall henceforth call a point set) in the (x, y)-plane, such that 
S contains the origin in its interior, under what conditions does it 
represent the unit disc belonging to some distance d? In other words, 
under what conditions does there exist a distance function d for which 
S contains exactly those points P that are characterized by the 
inequality 


aOP)< 1? 
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We claim that, for a given point set S having the origin in its interior 
there exists a distance function d such that S plus its boundary is the 
unit disc for d if and only if the following conditions are satisfied: 

(a) The point set S is symmetric with respect to the origin. 

(b) The point set S is convex. 

A point set S is symmetric with respect to the origin if, for each 
point (x, y) belonging to S, the point (—x, —y) also belongs to S. It is 
convex if, for each pair of points in S, the line segment joining these 
points is entirely in S. See Fig. 6.5, (a) and (b). 


y y 
x,y) 
AS -e . 
(-x,-y) 
(a) (b) 


Figure 6.5. Point sets in the plane 
(a) Point set symmetric with respect to the origin 
(b) Convex Point set 


We shall first prove that property (a) holds if d is a distance func- 
tion: If d is a distance function and S its unit disc, then S is symmetric 
with respect to the origin. In other words, if d is a distance function 
and P:(x,y) satisfies dOP) < 1, then the point Q:(— x, —y) 
satisfies d(OQ) < 1. 

If dis a distance, then it is translation invariant; hence, if we shift 
the points Q and O by an amount x in the horizontal direction and 
an amount y in the vertical, their distance d(QO) will remain un- 
changed. But such a shift brings the point Q into the origin O and 
the origin O into the point P. Hence 


HQO) = OP) < 1, 
and, since d is also symmetric, 

a(OQ) = QO) <1. 
Accordingly Q lies in the unit disc. 
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Next, let us prove that if d is a distance function then property (b) 
holds by showing that, if P and Q are any two points in the unit disc, 
then the line segment PQ lies entirely in the unit disc. Symbolically, 
the problem is expressed thus: Given any two points P, Q such that 
d(OP) < 1, HOQ) <1 for some distance function d, and any given 
point R on the segment PQ, prove that d(OR) < 1. This is imme- 
diately seen to be true if P = Q or if either P or Q is at the origin, 
sO we may assume that O, P, and Q are distinct points. 


Figure 6.6. Convexity of the unit disc 


The first step in the proof consists in expressing the fact that R lies 
on PQ in a convenient form. Let P’R be parallel to O@ and let Q’R 
be parallel to OP (see Fig. 6.6). Then we claim that 


(6.10) dOP’)=ad(OP), dOQ’)=bdOQ), 


where a>0, 6>0, and a+6=1. To see this, we shall first 
employ the Euclidean theory of proportions (similar triangles). Using 
the same symbol AB for a Euclidean line segment and its length, we 
have 


— OP _ QR — 0Q'_ PR 
(S10) ~ OP” OP’ >= 00 = PO" 


where a and b simply denote these ratios and hence are positive 
numbers. Moreover, by adding these ratios, we obtain 


QR+RP_QP_, 
QP QP 
Since P and P’ lie along the same straight line through O, their 
coordinates are proportional. We may therefore make use of prop- 
erty 5 of the Euclidean distance and deduce, from (6.11), that if P 
has coordinates (x1, y1), then P’ has coordinates (ax, ay;). Similarly, 


a+b= 
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since Q and Q” are on the same line through O, if Q has coordinates 
(2, y2), then Q’ has coordinates (bx2, bye). We now use the fact that 
our distance d also has property 5, so that 


(6.12) adOP’)=adOP), d0Q’)=badOQ). 
Finally, we use property 3 (the triangle inequality; see Fig. 6.6): 
d(OR) < d(OP’) + d(P’R), 
or, since dP’R) = OQ’) by property 1, 
OR) < HOP’) + a0?0’). 
Substituting from (6.12), we have 
d(OR) < ad(OP) + bd(OQ), 
and, since dOP) <1, d0Q) <1, and a+b=1, 
dOR)<a+b6b=1, 


which, by definition, means that R is in the unit disc. 

So far we have shown that, if dis a distance and S its unit disc, 
then S is symmetric with respect to the origin and convex. We must 
still prove the converse: If S is a point set containing the origin in its 
interior, and S is convex and symmetric with respect to the origin, then 
there exists a distance function d for which S is the unit disc. 

We shall indicate how such a distance d may be defined, but we 
shall leave to the reader the job of verifying that d has the five pre- 
scribed distance properties. 


Figure 6.7. (OZ) = oz, PQ) = re. 
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Let S be a point set of the prescribed sort; see Fig. 6.7. Let Z be 
any point other than O in the plane. Draw the ray from O through 
Z and let it intersect the boundary of S in the point Z’. (It follows 
from the convexity of S that there is just one such intersection of the 
ray with the boundary of S.) Then calculate the ratio 


_ OZ 
OZ’ 


of the Euclidean distances OZ to OZ’ and define the distance from 
O to Z to be this ratio; i.e., let 


d(OZ) = r. 


Observe that d(OZ) is less than 1, equal to 1, or greater than | de- 
pending on whether Z is an interior point of S, a boundary point of 
S, or an exterior point of S, respectively. 

To define d(PQ) for any points P and Q, shift the coordinates as 
indicated in Fig. 6.7 and proceed as before. 

In an ordinary Euclidean circle, the ratio of the circumference to 
the diameter is denoted by the symbol a and is approximately 3.14. 
In the exercises below, the task is to find the ratio r of the non- 
Euclidean length of the circumference to that of the diameter of some 
non-Euclidean unit circles. We shall analyze the situation in one par- 
ticular case, below, and leave other simple cases for the reader to 
investigate. 


r 


Figure 6.8. A regular hexagon symmetric with respect 
to the co-ordinate axes 


ExaMPLE. Let S consist of the points of a regular hexagon symmetrically 
situated with respect to the coordinate axes, together with the points of its 
interior; see Fig. 6.8. Since S$ is convex and symmetric with respect to the 
origin, it may be considered the unit disc for some distance function d. Since 
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the non-Euclidean radius of the unit disc is 1 by definition, its diameter is 2. 
To calculate the circumference, we observe that, because of the translation 
invariance of d, the following relations hold: 


AP,P2) = dOP3)=1, aAP2P3)=d(OPy)=1, a(P3Ps) = QOPs) = I, 
d(PsPs) = KOPs)=1, d(PsPe)= dOP:)=1, dPsP:) = HOP2) = 1. 


If we add the lengths of all these segments, we find that the circumference 
is 6. Hence, the desired ratio is 


Exercises 


Calculate the ratio of the non-Euclidean length of the circumference to that 
of the diameter for the following unit discs S: 


— 


. Sis the unit disc for the city-block distance |x2 — x1|+|ye — yal; see 
Fig. 6.3. 


2. S is the unit disc for the distance function 
d(OP) = max {|x|,\y]}. 
. The unit disc S consists of the interior and boundary points of a regular 
octagon with center at the origin. 


4. The unit disc S consists of the interior and boundary points of a regular 
ten-sided polygon with center at the origin. 


oe 


6.5 Algebra and Geometry 


What we have observed in the preceding sections is that geometric 
intuition can be used to derive interesting algebraic results. In two 
or three dimensions, this technique works well. As soon as we turn 
to a discussion of n-dimensional geometry, for n > 4, the situation 
reverses. Now, we often rely on algebra to make geometric definitions 
and to establish geometric results. 

Let us briefly illustrate this idea. Consider a set of n real quantities 
X1, X2,.++,Xn a8 constituting a point P in n-dimensional space. The 
Euclidean distance between the two points P:(x1, X2,...,Xn) and 
Q: (1, Ya, - ++» Yn) is defined to be 


(6.13) HPQ) = [C1 — yr)? + Cre — yo)? ++ + On ~ Yn)? }?. 


For n= 2, (6.13) reduces to the familiar expression for the distance 
between two points (x, x2) and (1, y2) in the plane. If we denote the 
origin (0, 0,...,0) by O and the point (x1 + yi, X2 + yo, .--5Xn +a) 
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by R, then the n-dimensional triangle inequality 


d(OP) + d(PR) > d(OR) 
would read 
[x1? + X_2 f+ eee + Xy?}/2 + [y1? + yo? ose + yr2p/2 
> [Orr + v1)? + (x2 + yo)? tees + tn + n)?P?. 


This is a valid inequality, as was noted on page 70 in Sec. 4.6. 
Next, let us define the cosine of the angle @ between the lines OP 


and O@ by 
(6.15) cos@= 


(6.14) 


Xiyyi + Xgv2 tere t+ Xan 
a? ket teeta Aaya? ies a Ya 
The Cauchy inequality [see Sec. 4.4(d), inequality (4.45)] shows that 


{cos @|< 1. 
We now have the foundations for an analytic geometry of n 


dimensions. 


SYMBOLS 


absolute value of a 
set of all negative numbers 
set of one element, namely, the number 0 
set of all positive numbers 
is a member of 
is not a member of 
equal to 
not equal to 
greater than 
not greater than 
less than 
not less than 
greater than or equal to 
neither greater than nor equal to 
less than or equal to 
neither less than nor equal to 
neither greater than nor less than 
nonnegative square root 
{@1,42,...,a,} set of elements a}, d2,..., Qn 
max {a1,a2,...,dn} greatest element of {aj, ao,...,@n} 
min {@3,@2,...,@n} least element of (1, a2, ..., Qn} 
{@1,@2,...,@,}* max {0, a1, a2,...,@n} 
{@1,@2,...,@n}~ min {0, a, @2,..., Gn} 
sgnx Ofor x=0, lfor x>0, —I for x<O0. 


& 
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Answers to Exercises 


Chapter 1 
Pages 12-13 
-1.5 3-9 4-3 ve 
Ca es Se ce 
3 -2 -l ) ' 2 3 


-3<¢ -2<¢ -15< -l<3—7<0< 7-3 VWI<2<3 


- (abe &)¢ © # Gt Oe ()4# Me ME WF (iF 


3. (a) N (b) P (C)P G)N ()P (ff) P (P(N WP (j)O 


Oo OI HW 


@a< ®%)> ©> @< ©> (> 
@M>W< @O> QO= 


- (a)T (6) T (c)T G)F (€)T €)T @F OT @F OF 


2, 7-3, —3—?2, a/(c—b), 0, — OF —4ac. 


-@2BH2O<S @M>O< M= 


p>0o0, —n>0, so p—n>0, so p>n. 


.a=b 
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10. True for n = 1 by hypothesis; induction step by Axiom II. 


11. Numbers I and 2 were shown in text to be “positive,” so 1 + 2 =3 is 
“positive” by Axiom II. Let a = 1/3; then 3a = 1, soa is “positive” 
since 3 and 1 are “positive.” Then 2a = 2/3 is “positive” by Axiom II. 


Chapter 2 
Page 19 
1. Add a/2 < b/2 first to a/2 = a/2, thento 6/2 = b/2. 
2. Both inequalities are equivalent to (ad — bc)? > 0. 
3. Equivalent to (a — 6)* > 0. 
4 


. For n = 2, the theorem states: If a, > az, then a; > a2, which is 


true. Suppose true for n and let a; > a2, ..., Qn-1 > n, Qn > Ana. 
Then a, > an, @n > Gnai, SO @; > Gny1. Sign of equality if and only 
if a, = a2, ..., Qn-1 = Gn, ANd Gy = Any. 

Pages 21-22 


1. Intermediate step: 
rie ere | 
Vab-a’ Bb’ 
Equality if and only if a = b. 
2. In Exercise 1, let b = 1/a. Equality if and only if a= 1. 
3. Add together a? + 62 > 2ab, 62 +c? > 2be, c? + a2 > 2ca, then 
divide by 2. 
4. Equivalent to a2b2(a — 6b)? > 0. 
5. Multiply a? + 6? > 2ab by c, b? + c? > 2be by a, c? + a® > 2ac by 
b, then add. 


6. Intermediate step: 
(a2 — b2)? — (a — b)4 = 4ab(a — b)?. 


7. Intermediate step: 
(a3 + b3) — (a2b + ab?) = (a + bla — 5)?. 


8 G)a=b=c; (4)a=5 orat least one of them = 0; 
(5)a=b=c; (6)a=b orat least one of them =0; (J)a= <b. 
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Page 24 


1. Equivalent to (a — b)? > 0. Sign of equality if and only if a=5b>0. 


. By Theorem 2.7 and the fact that c and d are of the same sign, ae ~— be 


and at — b¢ are of the same sign. Multiply out. Sign of equality if and 
only if a= 6. 


. a +b? >2ab, a+b>2Vab. 


4. The equivalence of these inequalities can be proved by multiplying the 


Ned 


a |W NO = 


first by bd>0 and the second by 1/bd>0; see Theorem 2.3. The 
equivalence of the corresponding equations is proved in the same way. 


. Add 1 =1 to the inequality. Equality a/b =c/d is equivalent to 


ad = be. 


. Apply Theorem 2.7 with exponent —1, add 1 = 1 to resulting inequality, 


then apply Theorem 2.7 again with exponent —1. Sign of equality as in 
Exercise 5. 


. Intermediate steps: 


atc _a._.be—ad +c be—ad 


—2= cate 0. 


a 

b+d b” Wb+d)~— > d b+d™ db+d)~ 
30 < 33; 22< 25; 18< 21; 42< 45. 
If a<b and b<c, then a<c. 
If a<b and c<d, then a+c<b+d. If a<b andc is any real 
number, then a+ce<b+c. 
If a<b and c>0, then ac<be. If a<b and c<0, then 
ac > be. 
If a<b and c<d, then a—d<b—c. If a<b andc is any real 
number, then a—ce<Cb—c. 
If 0<a<b and O<c<d, then ac< bd. 
If O0<a<b and O<c<d, then a/d< b/c. In particular, for 
a=b=l, if 0<c<d, then Il/d< I/c. 
If 0<a<b, if m and n are positive integers, and if a/ and bin 
denote positive nth roots, then 


qm/n < bmn and b-m/n < qum/n, 


Chapter 3 
Pages 28-29 


- (a) —1 () 7 ©0 (4 © 3 (0 (7 (HO 4 G3 
- @) —7 (&) V2 © —7 @)0 © —3 ) -7 @) 0 h) -7 (0 () -3 
» (~ DO) — (©) = 0. 


. Consider representative cases along with the definition of max { }. 
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5. 
6. 
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{a,b,c,d} = {-—1,-1, -1, -1}. 
One of {a,b}* and {c,d}* is equal to {a,b,c,d}*; the other is 
nonnegative. 


. First and third inequalities: possibly an additional candidate, namely 0. 


Middle inequality: definition of max{ } and min{ }. No: 
41, @2,...,4, would all have to be negative for first strict inequality, and 
all positive for third. 


8. Multiply inequalities a > b, a >c by —1 and apply Theorem 2.3. 
9. {-a, ~b}~ = min {0, —a, —b) = — max {0,a,b} = —{a,b}*. 


10. 


Consider representative cases along with definition of max{ }. 


Pages 33-34 


. For each real number a, —{ja|< —a <|a|. The first sign of equality 
holds if and only if a > 0, and the second if and only if a< 0. 


3. 


(a) is also the graph of (d), (g); (b) is also the graph of (e), (h); (c) is also 
the graph of (f), (i). 


ANSWERS TO EXERCISES 19 


6. Fig. 3.3, odd; Fig. 3.4, odd; Fig. 3.5, even; Fig. 3.6, even. 


120 AN INTRODUCTION TO INEQUALITIES 
Page 36 


Part (d) is easy; there is nothing to do. 
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tt) 


(b) 


(a) x <0, y>0, x+y<l; 
(b) x <0, y>0, x+y2>1; 
(ce) x>0, y>0, x+y 21; 
) x20, y20, x+y <1; 
(e) x< 0, ys 0, x+y <l; 
(ff) x>0, y<0, x+y <l; 
(g) x20, ySO0, x+y21. 
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Incomplete graph. 


Incomplete graph. 
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an Bw WwW NY 


. (a) |-al= V(—ay? 


. The y-axis, x = 0. Geometrically, the problem is to determine the locus 


of points equidistant from (— 1,0) and (1, 0). Analytically, it is to solve 


the equation 
VOF IEF = VEER. 


Page 45 


= Vat = lal 
(b) jab| = Y(abP = Va? VB? = |a|-|5| 


© [sl = JG) =a =P 


. (a) =, (b) >. (c) =, (d) =, (e) >. 
- (a) >. (b) =, () =, dd) =, |=. 
- (a) >, (b) =, (c) =, (d) >. (e) =. 
- (a) =, (bK) >, (>. (dd =. Ce) =. 


V@=OF > Vat — VBP, 
(a — bP > (Va? — Vb), 

a? — Jab + b? > a? — 2\/a2b? + 52, 
2 arb? > 2ab, 

Jab} > ab. 

Suppose a? < 6? and ab >0. Then 


min {a?, b?} = a? = |a|-|a| <|a|-(b| = ab. 


If a@>0, then Va? =a; if a<0, then Va? = —a>0. Similarly, 


Va? is 0if a =0, and otherwise it is the positive member of the set 
{a, —a}; \/a? = max {a, —a}; Va® = {a, —a}+; the graph of 
y = Vx® is shown in Fig. 3.6; and \/a? = asgna. 
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Chapter 4 
Page 49 
. (a) 4,5; (b) 6, 7.5; (c) 6, 6.5; (d) 0, 10. 
. (a) 3p, Sp; (b) 0, p/2; (c) 2p, p? + 1. 


Pages 51-52 
. a+ 6 = diameter = 27, so r = (a + 5)/2; by similar triangles, a/h = 
h/b, so h = vfab. 


. To show that the harmonic mean is less than or equal to the geometric 
mean, an intermediate step is 


ab(a ~ by? > 0; 
or see Exercise 1 on page 21. The sign of equality holds if and only if 
a = b. Toshow that the harmonic mean is less than or equal to the arith- 


metic mean, you can now use the arithmetic-mean-geometric-mean 
inequality or can proceed directly, with intermediate step 


(a — 6)? > 0. 
(a) 3.2, 4,5; (b) 4.8, 6, 7.5; (c) 5.547, 6, 6.5; (d) 5.83*, 5.91*, 6; (€) 6, 6, 6. 
4. Half the distance at each rate: 


» 


oe pepe ee =e a =4(1,1 
3 = it = rele, a= ri’ a= 3, Pah +hRa5 Pree s 
a _ td i i a 2 oa. 2rire 

a 2N\rn re . "=(/n)+U/n) nth 


Half the time at each rate: 


d= rat Fat =_ath 
lec Tala ee aa 


By the result of Exercise 2, half the time at each rate would get you there 
sooner, , 


. Let b= I/a. 


Pages 61-62 


. In the arithmetic-mean-geometric-mean inequality (4.19) set the first 
of the numbers a; equal to the same value yy, set the next me of the num- 
bers a, equal to the same number ye, and set the last m;, of the numbers 
a, equal to the same value y,, and observe that 

m+ mete + man. 
This gives the first inequality. For the second inequality, set 


m, me mM. 
—oPr —£= pf sihee St=rn, 
n 1» n 29 ’ k 
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2. 8.5,9.1+; 0.5,0.7*; p, p. 

3. Intermediate step: (a — b)? > 0; or see Exercise } on page 24. Equality 
holds if and only if a = 6. Since the root-mean-square is greater than 
or equal to the arithmetic mean, it is also greater than or equal to the 
geometric and harmonic means. 

4. (a) A diagonal divides GH into two segments, one of length a/2, the other 


of length 6/2. 
(b) If ABLK ~ KLDC, then AB/KL = KL/CD and KL? = \/ab. 


(c) The harmonic mean is 


To show that EF =h, prove that EO = OF and use similar triangles: 
EO _ AE _ AC-—EC_,_ EC 


CD AC AC AC’ 
But 
EC _ EO 
AC AB’ 
ees EO EO 1 1 
Bo =1-22 of OC get cp)e I 
thus 
~AB+CD__ab _1 
CCR ECD Gib 2 
ane EF =2EO =i: 


(d) Set MN =r, let x and y be the altitudes of the constituent trape- 
zoids so that x + y is the altitude of ABDC. Then, by hypothesis, 


rta.xalatb 4), FHP yap at bint yy, 


2 2 
This system of simultaneous linear equations in x and y has a solution if 
and only if 


rit bh, 


hence r is the root-mean-square of a and b. 
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Chapter 5 
Page 87 


1. Add together ab + bc + ca < a? + b? + c?, from Exercise 3 on page 
22, and 2(ab + be + ca) = 2ab + be + ca), to get 3(ab + be +a) < 
(a+6+c)*, whence 


_ 2 2. 2(E\? _ E2 
A = Yab + be + ca) <2 @+b+ eR = 2(2) ==. 


Equality in Exercise 3 on page 22 if and only if a=b=c. 

2. If the surface area A of a rectangular box is a fixed value A, then the sum 
of the lengths of the twelve edges is at least 2\/6A, and the box is a 
cube if and only if E = 2\/6A. The proof is immediate from the in- 
equality A < E2/24 of Exercise 1. 

3. Let A = area, P = perimeter, a and b = lengths of sides, c = length of 
diagonal. By inequality between arithmetic mean and root-mean-square 
(Exercise 3 on page 62), 


2 2 
P=4a+b)=4(242) <4 je a =2yTe. 


By inequality between geometric mean and arithmetic mean, 
es a+ by _ P? 
A= ab <(4$8)' =F. 


Hence 


The signs of equality hold if and only if a = 6. Hence the square has 
the greatest perimeter (2,\/2c) and the greatest area (c?/2). 


Pages 89-90 
lL. If y=z, thn s—y=s—z=x/2, so 


1? = ss — xXs — ys — 2) = 8 — x) FF = 


If x=y=z, then s—x=s—y=s—z=5/3, so 
E? = ss — x\(s — ys — 2) =F. 


2. Substitute for E?, J?, and A? from Exercise 1, then multiply out and com- 
pare terms. 


3. Each factor on the right is nonnegative, either because it is a square or 
because of its geometric meaning. Have E? = J? ifand only if x = 2s/3, 
ie., if and only if the isosceles triangle actually is equilateral. Have 
i? = A? if and only ifeither x =s or y =2, ie, if and only if the 
general triangle actually is isosceles. 
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4. The second inequality. See the discussion of J? = A? in the above 
solution of Exercise 3. 


5. The first inequality. See the discussion of E? = J? in the above solu- 
tion of Exercise 3. 


6. The formula shows that the square of the area of the triangle is equal to 
the square of the area of the equilateral triangle having the same perime- 
ter, diminished by a certain amount if the triangle is only isosceles, and 
diminished still more if the triangle is not even isosceles. 

7. Let a and 6 = lengths of sides, c = length of hypotenuse, f = length of 
altitude. Then by similar triangles, ¢ = ab/c. By the arithmetic-mean- 
geometric-mean inequality, 

a - P+? _ 
ec 7 2 


’ 


ue 


with equality if and only if a = 5, i.e., if and only if the right triangle 
is isosceles. 


Pages 97-98 
1. (5, —3) 
2. +2V%; -2VB<y <i vB 
= 42s 
ag tS 


4. If y is expressed in terms of x from the linear equation, and the resulting 
expression is substituted in the quadratic equation, then its discriminant 
is 4a2b2n2(q2m? + b2n? — k?); this expression vanishes if 


k= + Vaim? + Bent. 


The corresponding double roots are given on page 96, eqs. 5.23. 


Chapter 6 
Page 111 


Pie 
XY 


The city-block length of the diameter is 2 and that of each side is 
d,(P;, Po) = d\(O, Q) = 2. Hence the city-block length of the perimeter 
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is (442) = 8, and the desired ratio is 


The non-Euclidean length of the diameter is 2 and that of each side is 
d,(P,P2) = d,(OQ) = 2. Thus, the non-Euclidean length of the perime- 
ter is 8 and 


hae Se. 


3. Hint. Each regular polygon with an even number of sides and with 
center at the origin can be placed in two different positions, each sym- 
metric with respect to each coordinate axis. Try to prove that 
(a) if the number AN of sides of the polygon is divisible by 4, then the 
non-Euclidean length of each side is 2 tan (180°/N) [and hence the 
perimeter has non-Euclidean length 2N tan (180°/N)], 

(b) if the number N of sides is even but not divisible by 4, then the 
length of each side is 2 sin (180°/N) [and hence the perimeter is 
2N sin (180°/N)}, 
(c) the results (a) and (b) are valid for all possible positions of the 
polygon. 

Since N = 8 is divisible by 4, we have 


diameter = 2, 
non-Euclidean length of perimeter = 16 tan (180°/8) 


noi 


16 tan 22.5°, 
fe 6 tan 22.2" = 8 V2 —1) 
= 3.314. 


4. Since N = 10 is not divisible by 4, we have 


diameter = 2, 
non-Euclidean length of perimeter = 20 sin (180°/ 10), 


7 20sin 18" 10 sin 18° = 3.090. 
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